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FIELD OF THE INVENTION 

The invention relates to improved adenovirus vectors, and more specifically, 
adenovirus vectors useful for gene therapy. 

BACKGROUND 

Adenoviruses (Ad) are double-stranded DNA viruses. The genome of adenoviruses 
(-36 kb) is complex and contains over 50 open reading frames (ORFs). These ORFs are 
overlapping and genes encoding one protein are often embedded within genes coding for other 
Ad proteins. Expression of Ad genes is divided into an early and a late phase. Earlv genes 
are those transcribed prior to replication of the genome while late genes are transcribed after 
replication. The early genes comprise El a. Elb. E2a. E2b. E3 and E4. The E la sene 
products are involved in transcriptional regulation: the Elb gene products are involved in the 
shut-off of host cell functions and mRNA transport. E2a encodes the a DNA-binding protein 
(DBP): E2b encodes the viral DNA polymerase and preterminal protein (pTP). The E3 gene 
products are not essential for viral growth in cell culture. The E4 region encodes regulatory 
protein involved in transcriptional and post-transcriptional regulation of viral gene expression: 
a subset of the E4 proteins are essential for viral growth. The products of the late genes (e.g.. 
LI -5) are predominantly components of the virion as well as proteins involved in the 
assembly of virions. The VA genes produce VA RNAs which block the host cell from 
shutting down viral protein synthesis. 

Adenoviruses or Ad vectors have been exploited for the delivery of foreign genes to 
cells for a number of reasons including the fact that Ad vectors have been shown to be highly 
effective for the transfer of genes into a wide variety of tissues in vivo and the fact that Ad 
infects both dividing and non-dividing cells: a number of tissues which are targets for gene 
therapy comprise largely non-dividing cells. 

The current generation of Ad vectors suffer from a number of limitations which 
preclude their widespread clinical use including: 1) immune detection and elimination of cells 
infected with Ad vectors. 2) a limited carrying capacity (about 8.5 kb) for the insertion of 
foreign genes and regulatory elements, and 3) low-level expression of Ad genes in cells 
infected with recombinant Ad vectors (generally, the expression of Ad proteins is toxic to 
cells). 
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The latter problem was thought to be solved by using vectors containing deletions in 
the El region of the Ad genome (El gene products are required for viral gene expression and 
replication). However, even with such vectors, low-level expression of Ad genes is observed. 
It is now thought that most mammalian cells contain El -like factors which can substitute for 
5 the missing Ad El proteins and permit expression of Ad genes remaining on the El deleted 
vectors. 

What is needed is an approach that overcomes the problem of low level expression of 
Ad genes. Such an approach needs to ensure that adenovirus vectors are safe and non- 
immunogenic. 

10 

SUMMARY OF THE INVENTION 

The present invention contemplates two approaches to improving adenovirus vectors. 
The first approach generally contemplates a recombinant plasmid. together with a helper 
adenovirus, in a packaging cell line. The helper adenovirus is rendered safe by utilization of 
15 loxP sequences. In the second approach, "damaged" adenoviruses are employed. While the 
"damaged" adenovirus is capable of self-propagation in a packaging cell line, it is not capable 
of expressing certain genes (e.g.. the DNA polymerase gene and/or the adenovirus preterminal 
protein gene). 

In one embodiment of the first approach, the present invention contemplates a 
20 recombinant plasmid. comprising in operable combination: a) a plasmid backbone. 

comprising an origin of replication, an antibiotic resistance gene and a eukaryotic promoter 
element: b) the left and right inverted terminal repeats (ITRs) of adenovirus, said ITRs each 
having a 5* and a 3' end and arranged in a tail to tail orientation on said plasmid backbone; 
c) the adenovirus packaging sequence, said packaging sequence having a 5* and a 3' end and 
25 linked to one of said ITRs; and d) a first gerie of interest operably linked to said promoter 
element. 

While it is not intended that the present invention be limited by the precise size of the 
plasmid. it is generally desirable that the recombinant plasmid have a total size of between 27 
and 40 kilobase pairs. It is preferred that the total size of the DNA packaged into an EAM 
30 derived from these recombinant plasmids is about the length of the wild-type adenovirus 

genome (-36 kb). It is well known in the art that DNA representing about 105% of the wild- 
type length may be packaged into a viral particle: thus the EAM derived from recombinant 
plasmid may contain DNA whose length exceeds by -105% the size of the wild-type genome. 

. l - 
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The size of the recombinant plasmid may be adjusted using reporter genes and genes of 
interest having various sizes ( including the use of different sizes of introns within these genes) 
as well as through the use of irrelevant or non-coding DNA fragment which act as "stuffer" 
fragments (c^.. portions of bacteriophage genomes). 
5 in one embodiment of the recombinant plasmid. said 5" end of said packaging 

sequence is linked to said 3" end of said left ITR. In this embodiment, said first gene of 
interest is linked to said V end of said packaging sequence. It is not intended that the present 
invention be limited by the nature of the gene of interest: a variety of genes (including both 
cDNA and genomic forms) are contemplated: any gene having therapeutic value may be 

10 inserted into the recombinant plasmids of the present invention. For example, the transfer of 
the adenosine deaminase (ADA) gene is useful for the treatment of ADA- patients: the 
transfer of the CFTR gene is useful for the treatment of cystitic fibrosis. A wide variety of 
diseases are known to be due to a defect in a single gene. The plasmids. vectors and EAMs 
of the present invention are useful for the transfer of a non-mutated form of a gene which is 

1 5 mutated in a patient thereby resulting in disease. The present invention is illustrated using 
recombinant plasmids capable of generating encapsidated adenovirus minichromosomes 
(EAMs) containing the dystrophin cDNA gene (the cDNA form of this gene is preferred due 
to the large size of this gene): the dystrophin gene is non-functional in muscular dystrophy 
(MD) patients. However, the present invention is not limited toward the use of the dystrophin 

20 gene for treatment of MD: the use of the utrophin (also called the dystrophin related protein) 
gene is also contemplated for gene therapy for the treatment of MD [Tinsley et al. (1993) 
Curr. Opin. Genet. Dev. 3:484 and (1992) Nature 360:591]; the utrophin gene protein has 
been reported to be capable of functionally substituting for the dystrophin gene [Tinsley and 
Davies (1993) Neuromusc. Disord. 3:539]. As the utrophin gene product is expressed in the 

25 muscle of muscular dystrophy patients, no immune response would be directed against the 
utrophin gene product expressed in cells of a host (including a human) containing the 
recombinant plasmids. Ad vectors or EAMs of the present invention. While the present 
invention is illustrated using plasmids containing the dystrophin gene, the plasmids. Ad 
vectors and EAMs of the present invention have broad application for the transfer of any gene 

30 whose gene product is missing or altered in activity in cells. 

Embodiments are contemplated wherein the recombinant plasmid further comprises a 
second gene of interest. In one embodiment, said second gene of interest is linked to said 3' 
end ot said right ITR. In one embodiment, said second gene of interest is a reporter gene. A 
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variety of reporter genes are contemplated, including but not limited to £ coli p-«alactosidase 
gene, the human placental alkaline phosphatase gene, the green tluorescent protein gene and 
the chloramphenicol acetyltransferase gene. 

As mentioned above, the first approach also involves the use of a helper adenovirus in 
combination with the recombinant plasmid. In one embodiment, the present invention 
contemplates a helper adenovirus comprising i) first and a second loxP sequences, and ii) the 
adenovirus packaging sequence, said packaging sequence having a 5" and a 3* end. It is 
preferred that said first loxP sequence is linked to the 5* end of said packagine sequence and 
said second loxP sequence is linked to said 3' end of said packaging sequence. In one 
embodiment, the helper virus comprises at least one adenovirus gene coding reeion. 

The present invention contemplates a mammalian cell line containing the above- 
described recombinant plasmid and the above-described helper virus. It is preferred that said 
cell line is a 293-derived cell line. Specifically, in one embodiment, the present invention 
contemplates a mammalian cell line, comprising: a) a recombinant plasmid. comprising, in 
operable combination: i) a plasmid backbone, comprising an origin of replication, an 
antibiotic resistance gene and a eukaryotic promoter element, ii) the left and right inverted 
terminal repeats (ITRs) of adenovirus, said ITRs each having a 5* and a 3' end and arranged 
in a tail to tail orientation on said plasmid backbone, iii) the adenovirus packaging sequence, 
said packaging sequence having a 5* and a 3" end and linked to one of said ITRs. and iv) a 
first gene of interest operably linked to said promoter element: and b) a helper adenovirus 
comprising i) first and a second loxP sequences, and ii) the adenovirus packaging sequence, 
said packaging sequence having a 5" and a 3" end. As noted previously, said helper can 
further comprise at least one adenovirus gene coding region. 

Overall, the first approach allows for a method of producing an adenovirus 
minichromosome. In one embodiment, this method comprises: A) providing a mammalian 
cell line containing: a) a recombinant plasmid. comprising, in operable combination, i) a 
plasmid backbone, comprising an origin of replication, an antibiotic resistance gene and a 
eukaryotic promoter element, ii) the left and right inverted terminal repeats (ITRs) of 
adenovirus, said ITRs each having a 5' and a 3' end and arranged in a tail to tail orientation 
on said plasmid backbone, iii) the adenovirus packaging sequence, said packaging sequence 
having a 5' and a 3* end and linked to one of the ITRs. and iv) a first gene of interest 
operably linked to said promoter element; and b) a helper adenovirus comprising i) first and a 
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second luxP sequences, ii) at least one adenovirus gene coding region, and iii) the adenovirus 
packaging sequence, said packaging sequence having a 5" and a 3" end: and 
B) growing said cell line under conditions such that said adenovirus gene codins region is 
expressed and said recombinant plasmid directs the production of at least one adenoviral 
minichromosome. It is desired that said adenovirus minichromosome is encapsidated. 

In one embodiment, the present invention contemplates recovering said encapsidated 
adenovirus minichromosome and. in turn, purifying said recovered encapsidated adenovirus 
minichromosome. Thereafter, said purified encapsidated adenovirus minichromosome can be 
administered to a host (e.g., a mammal). Human therapy is thereby contemplated. 

It is not intended that the present invention be limited by the nature of the 
administration of said minichromosomes. All types of administration are contemplated, 
including direct injection (intra-muscular. intravenous, subcutaneous, etc.). inhalation, etc. 

As noted above, the present invention contemplates a second approaches to improving 
adenovirus vectors. In the second approach, "damaged" adenoviruses are employed. In one 
embodiment, the present invention contemplates a recombinant adenovirus comprising the 
adenovirus E2b region having a deletion, said adenovirus capable of self-propagation in a 
packaging cell line and said E2b region comprising the DN.A polymerase gene and the 
adenovirus preterminal protein gene. In this embodiment, said deletion can be within the 
adenovirus DNA polymerase gene. Alternatively, said deletion is within the adenovirus 
preterminal protein gene. Finally, the present invention also contemplates embodiments 
wherein said deletion is within the adenovirus DNA polymerase and preterminal protein 



uenes. 



The present invention further provides cell lines capable of supporting the propagation 
of. Ad virus containing deletions within the E2b region. In one embodiment the invention 

25 provides a mammalian cell line stably and constitutively expressing the adenovirus El gene 
products and the adenovirus DNA polymerase. In one embodiment, these cell lines comprise 
a recombinant adenovirus comprising a deletion within the E2b region, this E2b-deleted 
recombinant adenovirus being capable of self-propagation in the cell line. The present 
invention is not limited by the nature of the deletion within the E2b region. In one 

30 embodiment, the deletion is within the adenoviral DNA polymerase gene. 

The present invention provides cells lines stably expressing El proteins and the 
adenoviral DNA polymerase, wherein the genome of the cell line contains a nucleotide 
sequence encoding adenovirus DNA polymerase operably linked to a heterologous promoter. 
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In a particularly preferred embodiment, the cell line is selected from the group consisting of 
ihe B-6. B-9. C-l. C-4. C-7. C-13. and C-14 cell lines. 

The present invention further provides cell lines which further constitutive!)- express 
the adenovirus preterminal protein (pTP) gene product (in addition to El proteins and DNA 
5 polymerase). In one embodiment, these pTP-expressing cell lines comprise a recombinant 
adenovirus comprising a deletion within the E2b region, the recombinant adenovirus being 
capable of self-propagation in the pTP-expressing cell line. In a preferred embodiment the 
deletion within the E2b region comprises a deletion within the adenoviral preterminal protein 
gene. In another preferred embodiment, the deletion within the E2b region comprises a 
10 deletion within the adenoviral (Ad) DNA polymerase and preterminal protein genes. 

In a preferred embodiment, the cell lines coexpressing pTP and Ad DNA polymerase, 
contain within their genome, a nucleotide sequence encoding adenovirus preterminal protein 
operably linked to a heterologous promoter. In the invention is not limited by the nature of 
the heterologous promoter chosen. The art knows well how to select a suitable heterologous 
15 promoter to achieve expression in the desired host cell 293 cells or derivative thereof). 
In a particularly preferred embodiment, the pTP- and Ad poiymerase-expressing cell line is 
selected from the group consisting of the C-l, C-4. C-7. C-13. and C-14 cell lines. 

The present invention provides a method of producing infectious recombinant 
adenovirus particles containing an adenoviral genome containing a deletion within the E2b 
20 region, comprising: a) providing: i) a mammalian cell line stably and constitutively 
expressing the adenovirus El gene products and the adenovirus DNA polymerase: ii) a 
recombinant adenovirus comprising a deletion within the E2b region, the recombinant 
adenovirus being capable of self-propagation in said cell line: b) introducing the recombinant 
adenovirus into the cell line under conditions such that the recombinant adenovirus is 
25 propagated to form infectious recombinant adenovirus particles: and c) recovering the 

infectious recombinant adenovirus particles. In a preferred embodiment, the method further 
comprises d) purifying the recovered infectious recombinant adenovirus particles. In yet 
another preferred embodiment, the method further comprises e) administering the purified 
recombinant adenovirus particles to a host which is preferably a mammal and most preferably 
30 a human. 

In another preferred embodiment the mammalian cell line employed in the above 
method further constitutively expresses the adenovirus preterminal protein. 



- 6 - 



WO 98/1 7783 PCT/US97/1 954 1 

The present invention further provides a recombinant plasmid capable of replicating in 
a bacterial host comprising adenoviral E2b sequences, the E2b sequences containing a deletion 
within the polymerase gene, the deletion resulting in reduced polymerase activity. The 
present invention is not limited by the specific deletion employed to reduce polymerase 
5 activity. In a preferred embodiment, the deletion comprises a deletion of nucleotides 8772 to 
9385 in SEQ ID NO:4. In one preferred embodiment, the recombinant plasmid has the 
designation pApol. In another preferred embodiment, the recombinant plasmid has the 
designation pBHGl lApol. 

The present invention also provides a recombinant plasmid capable of replicating in a 

10 bacterial host comprising adenoviral E2b sequences, the E2b sequences containing a deletion 
within the preterminal protein gene, the deletion resulting in the inability to express functional 
preterminal protein without disruption of the VA RNA genes. The present invention is not 
limited by the specific deletion employed to render the pTP inactive: any deletion within the 
pTP coding region which does not disrupt the ability to express the Ad VA RNA genes may 

15 be employed. In a preferred embodiment, the deletion comprises a deletion of nucleotides 
10.705 to 1 1.134 in SEQ ID NO:4. In one preferred embodiment, the recombinant plasmid 
has the designation pApTP. In another preferred embodiment, the recombinant plasmid has 
the designation pBHGl lApTP. 

In a preferred embodiment, the recombinant plasmid containing a deletion with the 

20 pTP region further comprises a deletion within the polymerase gene, this deletion resulting in 
reduced (preferably absent) polymerase activity. The present invention is not limited by the 
specific deletion employed to inactivate the polymerase and pTP genes. In a preferred 
embodiment, the deletion comprises a deletion of nucleotides 8.773 to 9586 and 1 1.067 to 
12.513 in SEQ ID NO:4. In one preferred embodiment, the recombinant plasmid has the 

25 designation pAXBApolApTPVARNA+tl3. In another preferred embodiment, the recombinant 
plasmid has the designation pBHGl lApolApTPVARNA+tl3. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1A is a schematic representation of the Ad polymerase expression plasmid 
30 pRSV-pol indicating that the Ad2 DNA polymerase sequences are under the transcriptional 
control of the RSV-LTR/promoter element and are Hanked on the 3' end by the SV-40 small 
t intron and SV-40 polyadenylation addition site. 



- 7 - 



WO 98/17783 PCT/US97/ 19541 

Figure 1 B is a schematic representation of the expression piasmid pRSV-pTP • 
indicating that the Ad2 preterminal protein sequences are under the transcriptional control of 
the RSV-LTR/promoter element and are Hanked on the 3' end by the SV-40 small-t intron 
and SV-40 polyadenylation signals. 

Figure 2 is an eihidium bromide-stained gel depicting the presence of Ad pol DNA 
sequences in genomic DNA from LP-293 cells and several hygromycin-resistant cell lines. 
The -750 bp PCR products are indicated by the arrow. 

Figure 3 is an autoradiograph depicting the results of a viral replication- 
complementation assay analyzing the functional activity of the Ad polymerase protein 
expressed by LP-293 cells and several hygromycin resistant cell lines. The 8.010 bp HindlU 
fragments analyzed by densitometry are indicated by an arrow. 

Figure 4 is an autoradiograph indicating that cell lines B-6 and C-7 contained a 
smaller and a larger species of Ad polymerase mRNA while LP-293 derived RNA had no 
detectable hybridization signal. The location of the two species of Ad polymerase mRNA are 
indicated relative to the 28S and I8S ribosomal RNAs. and the aberrant transcript expressed 
by the B-9 cell line is indicated by an arrow. 

Figure 5 is an autoradiograph showing which of the cell lines that received the 
preterminal protein expression piasmid indicated the presence of pRSV-pTP sequences (arrow 
labelled "pTP") and El sequences (arrow labelled "El"). 

Figure 6 is an autoradiograph indicating in which of the cell lines transcription of 
preterminal protein is occurring (arrow labelled 3kb M ). 

Figure 7 is an autoradiograph indicating that the expression of the Ad-polymerase 
could overcome the replication defect of H5sub\00 at non-permissive temperatures. 

Figure 8 graphically depicts plaque titre for LP-293, B-6 and C-7 cell lines infected 
with wtAd5. H5ts36. or H5.v/v£100. and the results demonstrate that the C-7 cell line can be 
used as a packaging cell line to allow the high level growth of El. preterminal, and 
polymerase deleted Ad vectors. 

Figure 9 is an autoradiograph showing that the recombinant pol" virus is viable on 
pol-expressing 293 cells but not on 293 cells which demonstrates that recombinant Ad viruses 
containing the 612 bp deletion found within pApol lack the ability to express Ad polymerase. 

Figure 10 is a schematic representation of the structure of pAdSpdys wherein the two 
inverted adenovirus origins of replication are represented by a left and right inverted terminal 
repeat (LITR/RITR). PI and P2 represent location of probes used for 'Southern blot analysis. 
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Figure I I graphically illustrates the total number of transducing adenovirus particles 
produced (output) per serial passage on 293 cells, total input virus of either the helper (hpAP) 
or Adipdys. and the total number of cells used in each infection. 

Figures I2A-B show Southern blot analyses of viral DNA from lysates 3. 6. 9 and 12. 
digested with the restriction enzymes fl^HII. Nrul and EcoRV. For the analyses, fragments 
from the C terminus of mux musculus dystrophin cDNA (A) or the N terminus of E coli p- 
galactosidase (B) were labeled with dCTP ; - and used as probes. 

Figures I3A-B show the physical separation of AdSPdys from hpAP virions at the 
third (A) and final (B) stages of CsCl purification. 

Figure 14 graphically depicts the level of contamination of AdSpdys EAMs by hpAP 
virions obtained from the final stage of CsCl purification as measured by P-galactosidase and 
alkaline phosphatase expression. The ratio of the two types of virions - AdSpdys EAMs 
(LacZ) or hpAP (AP) in each fraction is indicated in the lower graph. 

Figures I5A-B are western blot (immunoblot) analyses of protein extracts from mdx 
myoblasts and myotubes demonstrating the expression of P-galactosidase (A) and dystrophin 
(B) in cells infected with AdSpdys EAMs. 

Figures 16A-C depict immunofluorescence of dystrophin expression in wild type 
MM 14 myotubes (A), uninfected mdx (B) and infected mdx myotubes (C). 

Figure 17 is a schematic representation of the MCfC/lacZ constructs tested to determine 
what portion of the - 3.3 kb DNA fragment containing the enhancer/promoter of the MCK 
gene is capable of directing high levels of expression of linked genes in muscle cells. 

Figure 18 is a schematic representation of a GFP/p-gal reporter construct suitable for 
assaying the expression of Crc recombinase in mammalian cells. 

Figure 19 is a schematic representation of the recombination event between the loxP 
shuttle vector and the Ad5 J/7001 genome. 

DEFINITIONS 

To facilitate understanding of the invention, a number of terms are defined below. 

The term "gene" refers to a DNA sequence that comprises control and coding 
sequences necessary for the production of a polypeptide or precursor thereof. The polypeptide 
can be encoded by a full length coding sequence or by any portion of the coding sequence so 
long as the desired enzymatic activity is retained. The term "gene" encompasses both cDNA 
and genomic forms of a uiven eene. 
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The term "wild-type" refers to a gene or gene product which has the characteristics of 
that gene or gene product when isolated from a naturally occurring source. A wild-type uene 
is that which is most frequently observed in a population and is thus arbitrarily designated the 
"normal" or "wild-type" form of the gene. In contrast, the term "modified" or "mutant" refers 
to a gene or gene product which displays modifications in sequence and or functional 
properties (i.e.. altered characteristics) when compared to the wild-type gene or gene product. 
It is noted that naturally-occurring mutants can be isolated; these are identified by the fact that 
they have altered characteristics when compared to the wild-type gene or gene product. 

The term "oligonucleotide" as used herein is defined as a molecule comprised of two 
or more deoxy ribonucleotides or ribonucleotides, usually more than three (3). and typically 
more than ten (10) and up to one hundred (100) or more (although preferably between twenty 
and thirty). The exact size will depend on many factors, which in turn depends on the 
ultimate function or use of the oligonucleotide. The oligonucleotide may be generated in any 
manner, including chemical synthesis. DNA replication, reverse transcription, or a 
combination thereof. 

As used herein, the term "regulatory element" refers to a genetic element which 
controls some aspect of the expression of nucleic acid sequences. For example, a promoter is 
a regulatory element which facilitates the initiation of transcription of an operably linked 
coding region. Other regulatory elements are splicing signals, polyadenylation signals, 
termination signals, etc. (defined infra). 

Transcriptional control signals in eucaryotes comprise "promoter" and "enhancer" 
elements. Promoters and enhancers consist of short arrays of DNA sequences that interact 
specifically with cellular proteins involved in transcription [Maniatis. T. et aL. Science 
236:1237 (1987)]. Promoter and enhancer elements have been isolated from a variety of 
eukaryotic sources including genes in yeast, insect and mammalian cells and viruses 
(analogous control elements, i.e.. promoters, are also found in procaryotes). The selection of 
a particular promoter and enhancer depends on what cell type is to be used to express the 
protein of interest. Some eukaryotic promoters and enhancers have a broad host range while 
others are functional in a limited subset of cell types [for review see Voss. S.D. et aL. Trends 
Biochem. Sci.. 1 1:287 (1986) and Maniatis. T. et aL. supra ( 1987)). 

The term "recombinant DNA vector" as used herein refers to DNA sequences 
containing a desired coding sequence and appropriate DNA sequences necessary for the 
expression of the operably linked coding sequence in a particular host organism (e.g. 
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mammal). DNA sequences necessary tbr expression in procaryotes include a promoter., 
optionally an operator sequence, a ribosome binding site and possibly other sequences. 
Eukaryotic cells are known to utilize promoters, polyadenlyation signals and enhancers. 

The terms "in operable combination", "in operable order" and "operably linked" as 
used herein refer to the linkage of nucleic acid sequences in such a manner that a nucleic acid 
.niolecule capable of directing the transcription of a given gene and/or the synthesis of a 
desired protein molecule is produced. The term also refers to the linkage of amino acid 
sequences in such a manner so that a functional protein is produced. 

The term "genetic cassette" as used herein refers to a fragment or segment of DNA 
containing a particular grouping of genetic elements. The cassette can be removed and 
inserted into a vector or plasmid as a single unit. A plasmid backbone refers to a piece of 
DNA containing at least plasmid origin of replication and a selectable marker gene (e.g.. an 
antibiotic resistance gene) which allows for selection of bacterial hosts containing the plasmid: 
the plasmid backbone may also include a polylinker region to facilitate the insertion of 
genetic elements within the plasmid. When a particular plasmid is modified to' contain non- 
plasmid elements (e.g.. insertion of Ad sequences and/or a eukaryotic gene of interest linked 
to a promoter element), the plasmid sequences are referred to as the plasmid backbone. 

Because mononucleotides are reacted to make oligonucleotides in a manner such that 
the 5' phosphate of one mononucleotide pentose ring is attached to the 3" cxyeen of its 
20 neighbor in one direction via a phosphodiester linkage, an end of an oligonucleotide is 
referred to as the M 5' end" if its 5' phosphate is not linked to the 3" oxygen of a 
mononucleotide pentose ring and as the "3" end" if its 3" oxygen is not linked to a 5" 
phosphate of a subsequent mononucleotide pentose ring. As used herein, a nucleic acid 
sequence, even if internal to a larger oligonucleotide, also may be said to have 5* and 3" ends. 
- 5 Uhen two different, non-overlapping oligonucleotides anneal to different regions of 

the same linear complementary nucleic acid sequence, and the 3' end of one oligonucleotide 
points towards the 5" end of the other, the former may be called the "upstream" 
.oligonucleotide and the latter the "downstream" oligonucleotide. 

The term "primer" refers to an oligonucleotide which is capable of acting as a point of 
30 initiation of synthesis when placed under conditions in which primer extension is initiated. 

An oligonucleotide "primer" may occur naturally, as in a purified restriction digest or may be 
produced synthetically. 
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A primer is selected to be "substantially" complementary to a strand of specific 
sequence of the template. A primer must be sufficiently complementary to hybridize with a 
template strand for primer elongation to occur. A primer sequence need not reflect the exact 
sequence of the template. For example, a non-complementary nucleotide fragment may be 
attached to the 5' end of the primer, with the remainder of the primer sequence being 
substantially complementary to the strand. Non-complementary bases or Ionizer sequences can 
be interspersed into the primer, provided that the primer sequence has sufficient 
complementarity with the sequence of the template to hybridize and thereby form a template 
primer complex for synthesis of the extension product of the primer. 

"Hybridization" methods involve the annealing of a complementary sequence to the 
target nucleic acid (the sequence to be detected). The ability of two polymers of nucleic acid 
containing complementary sequences to find each other and anneal through base pairing 
interaction is a well-recognized phenomenon. The initial observations of the "hybridization" 
process by Marmur and Lane. Proc. Natl. Acad Sci. USA 46:453 (1960) and Doty el a/.. 
Proc. \ : atl. Acad. Sci. USA 46:461 (1960) have been followed by the refinement of this 
process into an essential tool of modern biology. 

The complement of a nucleic acid sequence as used herein refers to an oligonucleotide 
which, when aligned with the nucleic acid sequence such that the 5* end of one sequence is 
paired with the 3* end of the other, is in "antiparallel association." Certain bases not 
commonly found in natural nucleic acids may be included in the nucleic acids of the present 
invention and include, for example, inosine and 7-deazaguanine. Complementarity need not 
be perfect: stable duplexes may contain mismatched base pairs or unmatched bases. Those 
skilled in the art of nucleic acid technology can determine duplex stability empirically 
considering a number of variables including, for example, the length of the oligonucleotide, 
base composition and sequence of the oligonucleotide, ionic strength and incidence of 
mismatched base pairs. 

Stability of a nucleic acid duplex is measured by the melting temperature, or "T m . M 
The T, n of a particular nucleic acid duplex under specified conditions is the temperature at 
which on average half of the base pairs have disassociated. The equation for calculating the 
T m of nucleic acids is well known in the art. 

The term "probe" as used herein refers to a labeled oligonucleotide which forms a 
duplex structure with a sequence in another nucleic acid, due to complementarity of at least 
one sequence in the probe with a sequence in the other nucleic acid. 
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The lerm "label" as used herein refers to any atom or molecule which can be used to 
provide a detectable (preferably quantifiable) signal, and which can be attached to a nucleic 
acid or protein. Labels may provide signals detectable by fluorescence, radioactivity, 
colorimetry. gravimetry. X-ray diffraction or absorption, magnetism, enzymatic activity, and 
the like. 

The terms "nucleic acid substrate" and nucleic acid template" are used herein 
interchangeably and refer to a nucleic acid molecule which may comprise single- or double- 
stranded DNA or RNA. 

"Oligonucleotide primers matching or complementary to a gene sequence" refers to 
oligonucleotide primers capable of facilitating the template-dependent synthesis of single or 
double-stranded nucleic acids. Oligonucleotide primers matching or complementary to a gene 
sequence may be used in PCRs. RT-PCRs and the like. 

A "consensus gene sequence" refers to a gene sequence which is derived by 
comparison of two or more gene sequences and which describes the nucleotides most often 
present in a given segment of the genes: the consensus sequence is the canonical sequence. 

The term "polymorphic locus" is a locus present in a population which shows variation 
between members of the population [i.e.. the most common allele has a frequency of less than 
0.95). In contrast, a "monomorphic locus" is a genetic locus at little or no variations seen 
between members of the population (generally taken to be a locus at which the most common 
allele exceeds a frequency of 0.95 in the gene pool of the population). 

The term "microorganism" as used herein means an organism too small to be observed 
with the unaided eye and includes, but is not limited to bacteria, viruses, protozoans, fungi, 
and ciliates. 

The term "microbial gene sequences" refers to gene sequences derived from a 
microorganism. 

The term "bacteria" refers to any bacterial species including eubacterial and 
archaebacterial species. 

The term "virus" refers to obligate, uitramicroscopic. intracellular parasites incapable 
of autonomous replication {i.e.. replication requires the use of the host cell's machinery). 
Adenoviruses, as noted above, are double-stranded DNA viruses. The left and right inverted 
terminal repeats (ITRs) are short elements located at the 5" and 3" termini of the linear Ad 
genome, respectively and are required for replication of the viral DNA. The left ITR is 
located between 1-130 bp in the Ad genome (also referred to as 0-0.5 mu). The right ITR is 
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located from -3.7500 bp to the end of the genome (also referred to as 99. 5-1 00 mu). The two 

ITRs are inverted repeats of each other. For clarity, the left ITR or 5* end is sued define the 
5* and 3' ends of the ITRs. The 5* end of the left ITR is located at the extreme 5* end of the 
linear adenoviral genome: picturing the left ITR (LITR) as an arrow* extending from the 5* 
end of the genome, the tail of the 5* ITR is located at mu 0 and the head of the left ITR is 
located at - 0.5 mu (further the tail of the left ITR is referred to as the 5* end of the left ITR 
and the head of the left ITR is referred to as the 3* end of the left ITR). The tail of the right 
or 3" ITR is located at mu 100 and the head of the rigt ITR is located at - mu 99.5: the head 
of the right ITR is referred to as the 5" end of the right ITR and the tail of the right ITR is 
referred to as the 3* end of the right ITR (RJTR). In the linear Ad genome, the ITRs face 
each other with the head of each ITR pointing inward toward the bulk of the genome. When 
arranged in a "tail to tail orientation" the tails of each ITR (which comprise the 5* end of the 
LITR and the 3' end of the RITR) are located in proximity to one another white the heads of 
each ITR are separated and face outward (see for example, the arrangement of the ITRs in the 
EAM shown in Figure 10 herein). The "adenovirus packaging sequence" refers to the ¥ 
sequence which comprises five (AI-AV) packaging signals and is required for encapsidation 
of the mature linear genome: the packaging signals are located from -194 to 358 bp in the Ad 
genome (about 0,5-1.0 mu). 

The phrase "at least one adenovirus gene coding region" refers to a nucleotide 
sequence containing more than one adenovirus gene coding gene. A "helper adenovirus" or 
"helper virus" refers to an adenovirus which is replication-competent in a particular host cell 
(the host may provide Ad gene products such as El proteins), this replication-competent virus 
is used to supply in trans functions (e.g.. proteins) which are lacking in a second replication- 
incompetent virus: the first replication-competent virus is said to "help" the second 
replication-incompetent virus thereby permitting the propagation of the second viral genome 
in the cell containing the helper and second viruses. 

The term "containing a deletion within the E2b region" refers to a deletion of at least 
one basepair (preferably more than one bp and preferably at least 100 and most preferably 
more than 300 bp) within the E2b region of the adenovirus genome. An E2b deletion is a 
deletion that prevents expression of at least one E2b gene product and encompasses deletions 
within exons encoding portions of E2b-specific proteins as well as deletions within promoter 
and leader sequences. 
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assays), fluorescent, radioactive, and luminescent systems, in one embodiment, the present 

invention contemplates the £ coli P-galactosidase gene (available from Pharmacia Biotech. 
Pistacataway. NJ). green Fluorescent protein (GFP) (commercially available from Clontech. 
Palo Alto. CA). the human placental alkaline phosphatase gene, the chloramphenicol 
5 acetyltransferase (CAT) gene: other reporter genes are known to the art and may be 
employed. 

As used herein, the terms "nucleic acid molecule encoding" "DNA sequence 
encoding." and "DNA encoding" refer to the order or sequence of deoxyribonucleotides along 
a strand of deoxyribonucleic acid. The order of these deoxyribonucleotides determines the 
10 order of amino acids along the polypeptide (protein) chain. The DNA sequence thus codes 
for the amino acid sequence. 

DESCRIPTION OF THE INVENTION 

The present invention provides improved adenovirus vectors for the delivery of 

15 recombinant genes to cells in vitro and in vivo. As noted above, the present invention 

contemplates two approaches to improving adenovirus vectors. The first approach generally 
contemplates a recombinant plasmid containing the minimal region of the Ad genome 
required for replication and packaging (i.e.. the left and right ITR and the packaging or V P 
sequence) along with one or more genes of interest; this recombinant plasmid is packaged into 

20 an encapsidated adenovirus minichromosome (EAM) when grown in parallel with an El- 
deleted helper virus in a cell line expressing the El proteins (e.y.. 293 cells). The 
recombinant adenoviral minichromosome is preferentially packaged. To prevent the 
packaging of the helper virus, a helper virus containing loxP sequences Hanking the V V 
sequence is employed in conjunction with 293 cells expressing Cre recombinase: Cre-loxP 

25 mediated recombination removes the packaging sequence form the helper genome thereby 

preventing packaging of the helper during the production of EAMs. In the second approach, 
"damaged" or "deleted" adenoviruses containing deletions within the E2b region are 
employed. While the "damaged" adenovirus is capable of self-propagation in a packaging cell 
line expressing the appropriate E2b protein(s), the E2b-deleted recombinant adenovirus are 

30 incapable of replicating and expressing late viral gene products outside of the packaging cell 
line. 

In one embodiment, the self-propagating recombinant adenoviruses containing 
deletions in the E2b region of the adenovirus genome. In another embodiment, "gutted" 
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In order to further cripple viral protein expression, and also to decrease the frequency 

of generating RCA. the present invention provides Ad vectors containing deletions in the E2b 
region. Propagation of these E2b-deleted Ad vectors requires cell lines which express the 
deleted E2b gene products. The present invention provides such packaging cell lines and for 
the first lime demonstrates that the E2b gene products. DNA polymerase and preterminal 
protein, can be constitutively expressed in 293 cells along with the El gene products. With 
every gene that can be constitutively expressed in 293 cells comes the opportunity to generate 
new versions of Ad vectors deleted for the respective genes. This has immediate benefits; 
increased carrying capacity, since the combined coding sequences of the polymerase and 
preterminal proteins that can be theoretically deleted approaches 4.6 kb and a decreased 
incidence of RCA generation, since two or more independent recombination events would be 
required to generate RCA. Therefore, the novel El. Ad polymerase and preterminal protein 
expressing cell lines of the present invention enable the propagation of Ad vectors with a 
carrying capacity approaching 13 kb. without the need for a contaminating helper virus 
[Mitani et ai (1995) Proc. Natl. Acad. Sci. USA 92:3854]. In addition, when genes critical 
to the viral life cycle are deleted (e.g.. the E2b genes), a further crippling of Ad to replicate 
and express other viral gene proteins occurs. This decreases immune recognition of virally 
infected cells, and allows for extended durations of foreign gene expression. The most 
important attribute of El. polymerase, and preterminal protein deleted vectors, however, is 
their inability to express the respective proteins, as well as a predicted lack of expression of 
most of the viral structural proteins. For example, the major late promoter (MLP) of Ad is 
responsible for transcription of the late structural proteins LI through L5 [Doertler. In 
Adenovirus DNA. The Viral Genome and Its Expression (Martinus Nijhoff Publishing Boston. 
1986)]. Though the MLP is minimally active prior to Ad genome replication, the rest of the 
late genes get transcribed and translated from the MLP only after viral genome replication has 
occurred [Thomas and Mathews (1980) Cell 22:523]. This cis-dependent activation of late 
gene transcription is a feature of DNA viruses in general, such as in the growth of polyoma 
and SV-40. The polymerase and preterminal proteins are absolutely required for Ad 
replication (unlike the E4 or protein IX proteins) and thus their deletion is extremely 
detrimental to Ad vector late gene expression. 
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II. Packaging Cell Lines Constitutively Expressing E2b Gene 
Products 

The present invention addresses the limitations of current or first generation Ad 
vectors by isolating novel 293 cell lines coexpressing critical viral gene functions. The 
5 present invention describes the isolation and characterization of 293 cell lines capable of 
constitutively expressing the Ad polymerase protein. In addition, the present invention 
describes the isolation of 293 cells which not only express the EI and polymerase proteins." 
but also the Ad-preterminal protein. The isolation of cell lines coexpressing the El, Ad 
polymerase and preterminal proteins demonstrates that three genes critical to the life cycle of 

10 Ad can be constitutively coexpressed. without toxicity. 

In order to delete critical genes from self-propagating Ad vectors, the proteins encoded 
. by the targeted genes have to first be coexpressed in 293 cells along with the El proteins. 
Therefore, only those proteins which are non-toxic when coexpressed constitutively (or toxic 
proteins inducibly-expressed) can be utilized. Coexpression in 293 cells of the El and E4 

15 genes has been demonstrated (utilizing inducible, not constitutive, promoters) [Yeh at ai 

(1996) J. Virol. 70:559; Wang ct ai (1995) Gene Therapy 2:775; and Gorziglia er ai (1996) 
J. Virol. 70:4173]. The El and protein IX genes (a virion structural protein) have been 
coexpressed [Caravokyri and Leppard (1995) J. Virol. 69:6627]. and coexpression of the El. 
E4. and protein IX genes has also been described [Krougliak and Graham (1995) Hum. Gene 

20 Ther. 6:1575]. 

The present invention provides for the first time, cell lines coexpressing El and E2b 
gene products. The E2b region encodes the viral replication proteins which are absolutely 
required for Ad genome replication [Doerfler. supra and Pronk et ai (1992) Chromosoma 
I02:S39-S45], The present invention provides 293 cells which constitutively express the 140 

25 kD Ad-polymerase. While other researchers have reported the isolation of 293 cells which 
express the Ad-preterminal protein utilizing an inducible promoter [Schaack et ai (1995) J. 
Virol. 69:4079]. the present invention is the first to demonstrate the high-level, constitutive 
coexpression of the El. polymerase, and preterminal proteins in 293 cells, without toxicity. 
These novel cell lines permit the propagation of novel Ad vectors deleted for the El, 

30 polymerase, and preterminal proteins. 
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III. Encapsidated Adenoviral Minichromosomes 

The present invention also provides encapsidated adenovirus minichromosome (EAM) 
consisting of an infectious encapsidated linear genome containing Ad origins of replication, 
packaging signal elements, a reporter gene (e.g.. a p-galactosidase reporter gene cassette) and 
a gene of interest {e.g.. a full length (14 kb) dystrophin cDNA regulated by a muscle specific 
enhancer/promoter). EAMs are generated by cotransfecting 293 cells with supercoiled 
plasmid DNA (e.g.. pAd5(Jdys) containing an embedded inverted origin of replication (and the 
remaining above elements) together with linear DNA from E l -deleted virions expressing 
human placental alkaline phosphatase (hpAP) (a helper virus). All proteins necessary for the 
generation of EAMs are provided in trans from the hpAP virions and the two can be 
separated from each other on equilibrium CsCI gradients. These EAMs are useful for gene 
transfer to a variety of cell types both in vitro and in vivo. 

Adenovirus-mediated gene transfer to muscle is a promising technology for gene 
therapy of Duchenne muscular dystrophy (DMD). However, currently available recombinant 
adenovirus vectors have several limitations, including a limited cloning capacity of -8.5 kb. 
and the induction of a host immune response that leads to transient gene expression of 3 to 4 
weeks in immunocompetent animals. Gene therapy for DMD could benefit from the 
development of adenoviral vectors with an increased cloning capacity to accommodate a full 
length (-14 kb) dystrophin cDNA. This increased capacity should also accommodate gene 
regulatory elements to achieve expression of transduced genes in a tissue-specific manner. 
Additional vector modifications that eliminate adenoviral genes, expression of which is 
associated with development of a host immune response, might greatly increase long term 
expression of virally delivered genes in vivo. The constructed encapsidated adenovirus 
minichromosomes of the present invention are capable of delivering up to 35 kb of non-viral 
exogenous DNA. These minichromosomes are derived from bacterial plasmids containing 
two fused inverted adenovirus origins of replication embedded in a circular genome, the 
adenovirus packaging signals, a Pgalactosidase reporter gene and a full length dystrophin 
cDNA regulated by a muscle specific enhancer/promoter. The encapsidated minichromosomes 
are propagated in vitro by fram-comptementation with a replication defective (E1+E3 deleted) 
helper virus. These minichromosomes can be propagated to high titer (>10Vml) and purified 
on CsCI gradients due to their buoyancy difference relative to helper virus. These vectors are 
able to transduce myogenic cell cultures and express dystrophin in myotubes. These results 
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demonstrate that encapsidated adenovirus minichromosomes are useful tor gene transfer to 
muscle and other tissues. 

The present invention further provides methods for modifying the above-described 
EAM svstem to enable the generation of high titer stocks of EAMs with minimal helper virus 
5 contamination. Preferably the EAM stocks contain helper virus representing less than 1%. 
.preferably less than 0.1% and most preferably less than 0.01% (including 0.0%) of the final 
viral isolate. 

The amount of helper virus present in the EAM preparations is reduced in two ways. 
The first is by selectively controlling the relative packaging efficiency of the helper virus 
10 versus ihe EAM virus. The Cre-loxP excision method is employed to remove the packaging 
.signals from the helper virus thereby preventing the packaging of the helper virus used to 
provide in trans viral proteins for the encapsidation of the recombinant adenovirus 
minichromosomes. The second approach to reducing or eliminating helper virus in EAM 
stocks is the use of improve physical methods for separating EAM from helper virus. 

15 

EXPERIMENTAL 

The following examples serve to illustrate certain preferred embodiments and aspects 
of the present invention and are not to be construed as limiting the scope thereof. 

In the experimental disclosure which follows, the following abbreviations apply: M 

20 (molar): mM (miliimolar): uM (micromolar): mol (moles); mmol (millimoles): umol 
(micromoles): nmol (nanomoles): mu or m.u. (map unit): g (gravity): gm (grams): me 
(milligrams): pg (micrograms): pg (picograms): L (liters): ml (milliliters): pi (microliters): cm 
(centimeters): mm (millimeters): iim (micrometers): run (nanometers): hr (hour): min 
.(minute): msec (millisecond): *C (degrees Centigrade): AMP (adenosine 5 -monophosphate): 

25 :; cDNA i copy or complimentary DNA): DTT (dithiotheritol): ddH : 0 (double distilled water); 
dNTP (deoxyribonucleotide triphosphate); rNTP (ribonucleotide triphosphate): ddNTP 
(dideoxyribonucleotide triphosphate): bp (base pair): kb (kilo base, pair): TLC (thin layer 
chromatography): tRNA (transfer RNA); nt (nucleotide): VRC (vanadyl ribonucleoside 
complex): RNase (ribonuclease): DNase (deoxyribonuclease); poly A (polyriboadenylic acid); 

30 PBS (phosphate buffered saline); OD (optical density): HEPES (N-[2- 

HydroN>ethyl]piperazine-N-[2-ethanesulfonic acid]): HBS (HEPES buffered saline); SDS 
(sodium dodecyl sulfate): Tris-HCI (tris[Hydroxymethyl]aminomethane-hydrochloride): rpm 
(revolutions per minute): ligation buffer (50 mM Tris-HCI. 10 mM MgCU. 10 mM 
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dithiothreiiol. 25 ng/ml bovine serum albumin, and 26 uM NAD*, and pH 7.8): EGTA 
(ethylene glycoI-bis(p-aminoethyl ether) N. N. N'. N'-tetraacetic acid): EDTA 
(ethylenediaminetetracetic acid): ELISA (enzyme linked immunosorbant assay): LB (Luria- 
Bertani broth: 10 » tryptone. 5 g yeast extract, and 10 g NaCI per liter. pH adjusted to 7.5 
5 with IN NaOH): superbroth (12 g tryptone. 24 g yeast extract. 5 g glycerol. 3.8 g KH.PO, 

and 12.5 g. K : HP0 4 per liter): DMEM (Duibeccos modified Eagle's medium): ABI (Applied 
Biosystems Inc.. Foster City. CA): Amersham (Amersham Corporation. Arlington Heiehts. 
ID: ATCC (American Type Culture Collection. Rockville. MY): Beckman (Beckman 
Instruments Inc.. Fullerton CA): BM (Boehringer Mannheim Biochemicals. Indianapolis. IN): 

10 Bio- 101 (Bio- 101. Vista. CA): BioRad (BioRad. Richmond. CA): Brinkmann (Brinkmann 

Instruments Inc. Wesbury. NY): BRL. Gibco BRL and Life Technologies (Bethesda Research 
Laboratories. Lite Technologies Inc.. Gaithersburg. MD): CRI (Collaborative Research Inc. 
Bedford. MA): Eastman Kodak (Eastman Kodak Co.. Rochester. NY): Eppendorf (Eppendorf 
Eppendorf North America. Inc.. Madison, WI): Falcon (Becton Dickenson Labware. Lincoln 

15 Park. NJ): IBI (International Biotechnologies. Inc., New Haven. CT): ICN (ICN Biomedicals. 
Inc.. Costa Mesa. CA): Invitrogen (Invitrogen. San Diego. CA): New Brunswick (New 
Brunswick Scientific Co. Inc.. Edison. NJ): NEB (New England BioLabs Inc.. Beverly. MA): 
NEN (Du Pont NEN Products. Boston. MA): Pharmacia (Pharmacia LKB Gaithersburg, MD): 
Promega (Promega Corporation. Madison. WI): Stratagene (Stratagene Cloning Systems. La 

20 Jolla. CA): UVP (UVP. Inc.. San Gabreil. CA); USB (United States Biochemical Corp.. 
Cleveland. OH): and Whatman (Whatman Lab. Products Inc. Clifton. NJ). 

Unless otherwise indicated, all restriction enzymes and DNA modifying enzymes were 
obtained from New England Biolabs (NEB) and used according to the manufacturers 
directions. 

25 

EXAMPLE 1 

Generation Of Packaging Cell Lines That Coexpress 
The Adenovirus El And DNA Polymerase Proteins 

30 In this example, packaging cell lines coexpressing Ad El and polymerase proteins 

were described. These cell lines were shown to support the replication and growth of H5ts36, 
an Ad with a temperature-sensitive mutation of the Ad polymerase protein. These 
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polymerase-expressing packaging cell lines can 
E 1 and polymerase functions. 
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be used to prepare Ad vectors deleted for the 



a) Tissue Culture And Virus Growth 

LP-293 cells (Microbix Biosystems. Toronto) were grown and serially passaged as 
suggested by the supplier. 

Plaque assays were performed in 60 mm dishes containing cell monolayers at - 90% 
conlluency. The appropriate virus dilution in a 2% DMEM solution was dripped onto the 
cells, and the plates incubated at the appropriate temperature for one hour. The virus 
containing media was aspirated, the monolayer was overlaid with 10 mis of a pre-warmed 
EiylEM agar overlay solution (0.8% Noble agar. 4% fetal calf serum, and antibiotics) and 
allowed to solidify. After the appropriate incubation lime (usually 7 days for incubations at 
38.5°C and 10-12 days for incubations at 32°C). five mis of the agar-containing solution 
containing 1.3% neutral red was overlaid onto the infected dishes and plaques were counted 
the next day. An aliquot of the virus H5ts36 [Freimuth and Ginsberg (1986) Proc. Natl. 
Acad. Sci. USA 83:7816] was utilized to produce high litre stocks after infection of 293 cells 
at 32°C. H5ts36 is an Ad5-derived virus defective for viral replication at. the nonpermissive 
temperature [Miller and Williams (1987) J. Virol. 61:3630]. 

The infected cells were harvested after the onset of extensive cytopathic effect, 
pelleted by centrifugation and resuspended in 10 mM Tris-CI. pH 8.0. The lysate was freeze- 
thawed three times and centrifuged to remove the cell debris. The cleared lysate was applied 
to CsCK step gradients (heavy CsCl at density of 1.45 g/ml. the light CsCI at density of] .20 
g/ml). ultracentrifuged. and purified using standard techniques [Graham and Prevec (1991) In 
Methods in Molecular Biology, Vol 7: Gene Transfer and Expression Protocols. Murray 
(ed.). Humana Press. Clifton. NJ. pp. 109-128]. The concentration of plaque forming units 
(pfu) of this stock was determined at 32°C as described above. Virion DNA was extracted 
from the high titre stock by pronase digestion, phenol-chloroform extraction! and ethanol 
precipitation. The leakiness of this stock was found to be <l in 2000 pfu at the non- 
permissive temperature, consistent with previous reports [Miller and Williams (1987). supra]. 



b) Plasmids 

The expression plasmid pRSV-pol [Zhao and Padmanabhan (1988) Cell 55:1005] 
contains sequences encoding the Ad2 polymerase mRNA (including the start codon from the 



WO 98/17783 PCT/US97/ 19541 

exon at map unit 59) under the transcriptional control of the Rous Sarcoma Virus 
LTR/promoter element: the Ad2 DNA polymerase sequences are Hanked on the 3* end by the 
SV-40 small t intron and SV-40 polyadenylation addition site (see Fig. I A). Fiuure I A 
provides a schematic representation of the Ad polymerase expression plasmid pRSV-pol. 
5 pRSV-pol includes the initiator methionine and amino-terminal peptides encoded bv the exon 
at m. u. 39 of the Ad genome. The location of the PCR primers p602a and p2158c. the two 
Seal 1 kb probes utilized for Northern analyses, the polymerase terminator codon. and the 
polyadenylation site of the IVa2 gene (at 1 1.2 m.u. of the Ad genome) are indicated. 

The expression plasmid pRSV-pTP [Zhao and Padmanabhan (1988). supra] contains 

10 sequences encoding the Ad2 preterminal protein (including the amino terminal peptides 

encoded by the exon at map unit 39 of the Ad genome) under the transcriptional control of 
the Rous Sarcoma Virus LTR/promoter element; the pTP sequences are flanked on the 3' end 
by the SV-40 small-t intron and SV-40 polyadenylation signals (see Figure IB for a schematic 
of pRSV-PTP). Figure IB also shows the location of the EcoRV subfragment utilized as a 

15 probe in the genomic DNA and cellular RNA evaluations, as well as the initiator methionine 
codon present from map unit 39 in the Ad5 genome. The locations of the H5/>7l90 and 
H5.S7//?100 insertions are shown relative to the preterminal protein open-reading frame. The 
following abbreviations are used in Figure 1 : ORF. open reading frame: small-t. small tumor 
antigen, and m.u.. map unit. 

20 pCEP4 is a plasmid containing a hygromycin expression cassette (Invitrogen). 

pFG140 (Microbix Biosystems Inc. Toronto. Ontario) is a plasmid containing 
sequences derived from Ad5c//309 which contain a deletion/substitution in the E3 region 
[Jones and Shek (1979) Cell 17:683). pFGMO is infectious in single transfections of 293 
cells and is used as a control for transfection efficiency. 

25 

c) Transfection Of 293 Cells 

LP-293 cells were cotransfected with BamHX linearized pRSV-pol and BamHl 
linearized pCEP4 at a moliar ratio of 10:1 using a standard CaP0 4 precipitation method 
[Sambrook cf al. (1989) Molecular Cloning: A Laboratory Manual. Cold Spring Harbor 
30 Press. Plainview. NY. pp. 16.33-1 6.36]. In addition. 293 cells were cotransfected with or with 
DamHX linearized pRSV-pol. BamHl linearized pRSV-pTP and pCEP using a molar ratio of 
10:1 (non-selectable:selectable plasmids). 
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Forty-eight hours alter transfection. the cells were passaged into media containing 
hygromycin at 100 ug/mL. Individual hygromycin resistant colonies were isolated and 
expanded. 



d) Isolation Of Ad Polymerase Expressing 293 Cell Lines 

Twenty hygromycin resistant cell lines were expanded and screened for the ability to 
express the Ad polymerase protein. Initially, the individual cell lines were assayed for the 
ability to support growth of the viral polymerase mutant H5ts36 using the plaque assay 
described in section a) above. It was speculated that constitutive expression of the wild type 
Ad polymerase protein in a clonal population of 293 ceils should allow the growth of H5ts36 
at 38.5°C. However, it was unclear if constitutive expression of the Ad polymerase would be 
toxic when coexpressed with the El proteins present in 293 cells. Similar toxicity problems 
have been observed with the Ad ssDBP. and the pTP [Klessig ci at. (1984) Mol. Cell. Biol. 
4:1354 and Schaack et al. (1995) J. Virol. 69:4079]. 

Of the twenty hygromycin resistant cell lines isolated, seven were able to support 
plaque formation with H5ts36 at the non-permissive temperature, unlike the parental LP-293 
cells: these cell lines were named B-6. B-9. C-l. C-4. C-7..C-I3 and C-14 (see Table 1) (the 
B-6 and B-9 cell lines received only the pRSV-pol and CEP4 plasmids: C-l. C-4. C-7. C-l 3 
and C-14 cell lines received the pRSV-pol. pRSV-pTP and CEP4 plasmids). For the results 
shown in Table I. dishes (60 mm) of near confluent cells of each cell line were infected with 
the same dilution of H5ts36 at the temperature indicated, overlaid with agar media, and 
stained for plaques as outlined in section a. Passage number refers to the number of serial 
passages after initial transfection with the plasmid pRSV-pol. 

The cell line B-6 produced plaques one day earlier than the other Ad polymerase- 
expressing cell lines, which may reflect increased polymerase expression (see below). Cell 
line B-9 demonstrated an increased doubling time whereas each of the other cell lines 
displayed no growth disadvantages relative to the parental LP-293 cells. As shown in Table 
1. even after multiple passages (in some instances up to four months of serial passaging) the 
cells were still capable of H5ts36 plaque formation at the non-permissive temperature, 
indicating that the RSV-LTR/promoter remained active for extended periods of time. 
However, the cell lines B-9 and C-l 3 displayed a decreased ability to plaque the virus at 32°C 
as well as at 38.5°C. suggesting that a global viral complementation defect had occurred in 
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these ceil lines after extended passaging. The remaining cell lines screened at later passaces 
demonstrated no such defect, even after 20 passages (e.g.. cell line B-6. Table IV 

TABLE I 



Plnquinc Ability Of H5ts36 At The Non- Permissive Temperature Utilizing 293-Ad Polymerase Expressing 

Cell Lines 



Cell Line 


Passage Number 


Number Of Plaques At: 




32.0°C 


38.5°C 


293 


- 


>500 


0 


B-6 


9 


>500 


>500 


20 


>500 


>500 


B-9 


S 


>500 


>500 


14 


90 


18 


c \ 


5 


-500 


500 


L • 1 


13 


-500 


-500 


C J 


5 


>500 


>500 


14 


>500 


-500 


C-7 


5 


>500 


-500 


14 


>500 


^500 




5 


>500 


>500 


C-13 


27 , 


120 


4 




5 


>500 


>500 


C-14 


27 


>500 


370 



e) Genomic Analysis Of Ad Polymerase-Expressing Cell Lines 

Genomic DNA from LP-293 cells and each of the seven cell lines able to complement 
H5ts36 at 38.5°C were analyzed by PCR for the presence of pRSV-pol derived sequences. 
Genomic DNA from LP-293 cells and the hygromycin resistant cell lines were harvested 
using standard protocols (Sambrook et aL. supra) and 200 ng of DNA from each cell line was 
analyzed by PCR in a solution containing 2 ng/mL of primers p602a and p2158c. 10 mM 
Tris HCl. pH 8.3. 50 mM KC1. 1.5 mM MgCk and 0.001% gelatin. The forward primer. 
P 602a [5 -TTCATTTTATGTTTCAGGTTC AGGG-3" (SEQ ID NO:2)] is located in the SV- 
40 polyadenylation sequence. The reverse primer p2158c (5*- 

TTACCGCCACACTCGCAGGG-3' (SEQ ID NO:3)] is Ad-sequence specific with the 5* 
nucleotide located at position 3394 of the Ad 5 genome [numbering according to Doerfler 
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The resulting autoradiograph is shown in Figure 3. In Figure 3. the lane marked 
"Std." (standard) contains I |jg of ///TidHI-digested H5ts36 virion DNA. The 8.010 bp 
Hindlll fragments analyzed by densitometry are indicated by an arrow. 

As shown in Figure 3. H5ts36 had a diminished ability to replicate in LP-293 cells at 
the non-permissive temperature. In contrast, all seven of the previously selected cell lines 
were able to support replication of H5ts36 virion DNA at 38.5°C to levels approaching those 
occurring in LP-293 cells at 32°C. 

A densitometry analysis of the amount of H5ts36 viral DNA replicated in each of the 
cell lines at permissive and nonpermissive temperatures is presented in Table 2 below. For 
this assay, the relative amounts of the 8.010 bp Hindlll fragment were compared. The 
relative levels of H5ts36 virion DNA replication determined by densitometric analysis of the 
8.010 bp Hindlll fragment isolated from each of the cell line DNA samples. The surface area 
of the 8.010 bp fragment in 293 cells incubated at 38,5°C was designated as 1. and includes 
some replicated H5ts36 virion DNA. The numbers in each column represent the ratio 
between the densities of the 8.010 bp fragment isolated in the indicated cell line and the 
density of the same band present in LP-293 cells at 38.5°C. 

As shown in Table 2, the levels of replication at the permissive temperature were all 
within four-fold of each other, regardless of which cell line was analyzed, but at the non- 
permissive temperature LP-293 cells reveal the H5ts36 replication defect. The viral bands 
that were present in the LP-293 DNA sample at 38. C represented input virion DNA as well as 
low level replication of H5ts36 DNA. which is generated due to the leakiness of the ts 
mutation at the high MOl utilized in this experiment. The Ad polymerase-expressing cell 
tines were all found to be capable of augmenting 
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location of the smaller and larger species of Ad polymerase mRNA are indicated relative to 
the 28S and 18S ribosomal RNAs. The aberrant transcript expressed by the B-9 cell line is 
indicated by an arrow. 

The results shown in Figure 4 revealed that the RNA derived from cell lines B-6 and 

5 C-7 contained two species of RNA. estimated to be ^4800 and -7000 nt in length, while LP-293 
derived RNA had no detectable hybridization signal. The presence of two polymerase RNA 
species suggests that the polyadenylation signal of the Ad IVa2 gene (present in the Ad-pol 
construct, see Fig. 1 A) is being utilized by the cell RNA processing machinery, in addition to 
the SV-40 polyadenylation signal. Similar analysis of RNA derived from the cell lines C-l. 

10 C-4. C-13. and C-14 also detected the same two transcripts as those detected in the RNA of 
cell lines B-6 and C-7. but at decreased levels, suggesting that even low levels of Ad 
polymerase mRNA expression can allow for the efficient replication of polymerase mutants 
such as H5ts36. The cell line B-6 expressed high levels of polymerase transcript and can 
plaque H5ts36 one day earlier than the other cell lines at 38.5°C. suggesting a causal 

15 relationship- It is interesting to note that the two polymerase transcripts are also detected in 
RNA isolated from cell line B-9, but substantial amounts of a larger RNA transcript (size > 
10 kb) is also present (see Figure 4). The high level production of the aberrant message may 
be related to the increased doubling time previously noted in this cell line. 

20 h) Transferability Of Ad Polymerase-Expressing Cell Lines 

The ability of Ad polymerase-expressing 293 cell lines to support production of 
H5ts36 virions after transfection with H5ts36 genome DNA was examined as follows. 293 
cells as well as hygromycin resistant cell lines were grown to near confluency on 60 mm 
dishes and transfected with either 3 |ig of purified H5ts36 virion DNA, or with 3.5 jig of the 
25 plasmid pFG140 (Microbix Biosystems), using the cationic lipid Lipofectamine (Gibco-BRL). 
Cells that received the H5ts36 virion DNA were incubated at 32°C for 14 days, or 38.5°C for 
10 days. The pFG140 transfected cells were incubated at 37.5°C for 10 days. All plates 
were then stained with the neutral red agar overlay and plaques were counted the next day. 
The results are shown in Table 3. 
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expressing packaging cell lines can be used to prepare Ad vectors deleted for the El and 
polymerase functions. In this example. 293 cells cotransfected with both Ad polymerase and 
preterminal protein expression plasmids are characterized. Cell lines co-expressin° the Ad El. 
polymerase and preterminal proteins can be used to prepare Ad vectors deleted for the EI. 
polymerase and preterminal protein (pTP) functions. 

a) Tissue Culture And Virus Propagation 

The use of LP-293 cells (Microbix Biosystems Inc., Toronto). Ad-polymerase 
expressing cell lines, and plaquing efficiency assays of Ad viruses was conducted as described 
in Example 1. All cells were maintained in 10% fetal bovine serum supplemented DMEM 
media (GIBCO) in the presence of antibiotics. The virus H5sh&100 [Freimuth and Ginsberg 
(1986) Proc. Natl. Acad. Sci. USA 83:7816] has a temperature sensitive (ts) mutation caused 
by a three base pair insertion within the amino terminus of the preterminal protein, in addition 
to a deletion of the El sequences (see Figure IB). H5sub\00 was propagated and titred at 
32.0°C in LP-293 cells: the leakiness of this stock was less than I per 1000 plaque-forming 
units (pfu) at the nonpermissive temperature of 38.5°C. A lower titer cell lysate contaning 
the virus H5/«190 (which contains a 12 base-pair insertion within the carboxy-terminus of the 
preterminal protein as well as a deletion of the El region, see Figure IB) was provided by Dr. 
P. Freimuth [Freimuth and Ginsberg (1986), supra). The polymerase and preterminal protein 
expressing cell lines were always maintained in media supplemented with hygromycin 
(Sigma) at 100 pg/mL. 

b) Isolation Of Ad Polymerase And Preterminal Protein 
Expressing 293 Cells 

The C-l. C-4, C-7. C-l 3, and C-l 4 cell lines (Ex. 1), which had been cotransfected 
with pRSV-poh pRSV-pTP and CEP4. were screened for presence of pTP sequences and for 
the ability to support the growth of H5ts36 (ts for the Ad-polymerase), H5iwI90. and 
H5A7//H00 using plaque assays as described in Example 1. 
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i) Analysis Of Genomic DNA And Cellular RNA 

Cell lines thai had received the preterminal protein expression plasmid were screen tor 
the presence of pRSV-pTP sequences and El sequences. Total DNA was isolated from the 
LP-293. B-6 (transfected with pRSV-pol only) or C-7 (cotransfected with pRSV-pol and 
pRSV-pTP) cell lines, two micrograms (ug) of each DNA was codigested with the restriction 
enzymes Xhal and Bc/mHI. electrophoretically separated in a 0.6% agarose cel. and 
transferred onto a nylon membrane. The membrane was UV crosslinked. probed with both a 
1.8 kb Bln\-Xha\ fragment (spans the El coding region) isolated from the plasmid pFGI40. 
and a I.Skb EcoRV ^fragment of Ad serotype 5 (spans the preterminal protein coding 
sequences: see Fig. IB), both of which were random-primer radiolabeled with i2 P to a specific 
activity greater than 3.0 x 10 s cpm/ug. The membrane was subsequently exposed to X-ray 
film with enhancement by a fluorescent screen. The resulting autoradiograph is shown in 
Figure 5. 

In Figure 5. the preterminal specific sequences migrated as an ^1 1.0 kb DNA fragment 
while the El containing band migrated as a 2.3 kb DNA fragment. No hybridization of either 
probe to DNA isolated from either the LP-293 or B-6 cell lines was observed As shown in 
Figure 5. only the C-7 cell line genomic DNA had preterminal coding sequences, unlike the 
parental LP-293 cells, or the Ad-polymerase expressing B-6 cells. In addition, all cell lines 
had El specific sequences present at nearly equivalent amounts, demonstrating that the 
selection design has not caused the loss of the EI sequences originally present in the LP-293 
cells. The results presented in Example I demonstrated that both the B-6 and C-7 cell lines 
contain polymerase specific sequences within their genomes, unlike the parental LP-293 cells. 

To confirm that transcription of preterminal protein was occurring, total RNA was 
isolated from each of the cell lines, transferred to nylon membranes, and probed to detect 
■ preterminal protein-specific mRNA transcripts as follows. Total cellular RNA was isolated 
from the respective cell lines and 15 ug of total RNA from each cell line was transferred to 
nylon membranes. The membranes were probed with the 1.8 kb EcoRV radiolabeled 
subfragment of Ad5 (see Fig. IB) complementary to the preterminal protein coding region. 
The resulting autoradiograph is shown in Figure 6. 

As shown in Figure 6. a single mRNA of the expected size (~3 kb in length) is detected 
only in RNA derived from the C-7 cell line. No hybridization was detected in lanes 
containing RNA derived from the LP-293 or B-6 cell lines. In Example 1. it was 
demonstrated that the C-7 cell line also expresses high levels of the Ad polymerase mRNA. 
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Thus, the C-7 cell line constitutively expresses both the Ad polymerase and preterminal 
protein mRNAs along with EI transcripts. 



ii) Plaquing Efficiency Of pTP Mutants On pTP- 
Expressing Cell Lines 

The C-7 cell line was screened for the ability to transcompiement the erowth of 
preterminal mutant viruses. The virus H5/«190 (contains a 12 base pair insertion located 
within the carboxv-terminus of the preterminal protein) has been shown to have a severe 
growth and replication defect, producing less than 10 plaque-forming units per cell [Freimuth 
and Ginsberg (1986) Proc. Natl. Acad. Sci. USA 83:7816], The results are summarized in 
Table 4 below. For the results shown in Table 4, LP-293. B-6. or C-7 cells were seeded at a 
density of 2.0-2.5 x 10 6 cells per plate. The cells were infected with limiting dilutions of 
lysates derived from the preterminal protein-mutant viruses H5//7I90 or M5a-«6IO0. incubated 
at 38.5°C. and plaques counted after six days. As shown in Table 4. only the C-7 cell line 
could allow efficient plaque formation of H5/nl90 at 38.5°C (the H5//?190 lysate used to 
infect the cells was of a low titer, relative to the high titer H5suh\00 stock), while both the B- 
6 and C-7 cell lines had nearly equivalent plaquing efficiencies when H5.V//M00 was utilized 
as the infecting virus. 

TABLE 4 



Plaquing Efficiency Of Preterminal Protein-Mutant Viruses 



Virus 


Mutation 
Location 


Plaque Titres (pfu/ml) 


LP-293 


B-6 


C-7 


H5//»I90 


carboxv-terminus 


<l x I0 : 


<i x 10 : 


1.4 x 10* 


H5sub\Q0 


amino-terminus 


<\ x 10" 


9.0 x 10* 


4.5 x 10" 



As shown in Table 4. when equivalent dilutions of H5/nl90 were utilized, the plaquing 
efficiency of the C-7 cell line was at least 100-fold greater than that of the B-6 or LP-293 
cells. This result demonstrated that the C-7 cell line produces a functional preterminal 
protein, capable of trans-complementing the defect of the H5ml90 derived preterminal 
protein. 

The cell lines were next screened for the ability to trans-complement with the 
temperature-sensitive virus. HSsublQQ. at nonpermissive temperatures. H5sub\00 has a codon 
insertion mutation within the amino-terminus of the preterminal protein, as well as an El 
deletion. The mutation is responsible both for a temperature sensitive growth defect, as well 
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as a replication defect [Freimuth and Ginsberg ( 1986). supra and Schaack et al. (1995) J. 
Virol. 69:4079]. The plaquing efficiency of the cell line C-7 was found to be at least 1000 
fold greater than that of the LP-293 cells (at non-permissive temperatures) (see Table 4). 
interestingly, the cell line B-6 was also capable of producing large numbers of \\5sub\W 
5 derived plaques at 38.5°C. even though it does not express any preterminal protein. This 
, result suggested that the high level expression of the polymerase protein was allowing plaque 
formation of H5i«6100 in the B-6 cell line. To examine this possibility, the nature of 
H5.vw/?I00 growth in the various cell lines was examined. 

10 .. c > Complementation Of The Replication And Growth Defects 

Of HSjmMOO 

The cell lines B-6 and C-7 were shown to overcome the replication defect of H5ts36 
(Ex. I). Since the preterminal and polymerase proteins are known to physically interact with 
each other [Zhao and Padmanabhan ( 1988) Cell 55:1005]. we investigated whether the 

1 5 expression of the Ad-polymerase could overcome the replication defect of H5.n/6I00 at non- 
permissive temperatures using the following replication-complementation assay. 

LP-293. B-6. or C-7 cells were seeded onto 60 mM dishes at a density of 2xI0 6 cells 
per dish and infected the next day with H5sub\0Q at a multiplicity of infection (MOI) of 0.25, 
and incubated at 38.5°C for 16 hours or 32.0°C for 40 hours. The cells from each infected 

20 plate were then harvested and total DNA extracted as described in Example I. Four 

micrograms of each DNA sample was digested with HindlU. electrophoresed through a 0.7% 
agarose gel. transferred to a nylon membrane, and probed with ;: P-labeled H5ts36 virion 
DNA. The resulting autoradiograph is shown in Figure 7. As seen in Figure 7. the 
J [. H5a-m*100 replication defect when grown in LP-293 cells at 38.5°C is seen: this defect is not 

25 present when the virus is grown at the same temperature in either B-6 or C-7 cells. 

The results depicted in Figure 7 demonstrates that both cell lines B-6 and C-7 could 
trans-complement the replication defect of HSsublOO. this result demonstrated that the 
expression of the Ad polymerase in B-6 cells was able to overcome the preterminal protein- 
mediated replication defect of H5iu/>100. While not limiting the present invention to any 

30 particular mechanism, the ability of Ad polymerase to overcome the preterminal protein- 
mediated replication defect of H5sub\00 may be due to a direct physical interaction of the 
polymerase with the amino-terminus of the H5.sw6100-derived preterminal protein. 
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In support of this hypothesis, it has also been demonstrated that the H5.H/6100 
replication defect can be overcome When LP-293 cells were infected with a 100-fold greater 
amount of HSsublOO. However, complementation of the HSsublOO replication defect is not 
sufficient to overcome the growth defect of HSsublOO. since temperature shift-up experiments 
5 have demonstrated that the HSsublOO growth defect is not dependent upon viral replication 
[Schaack et aL (1995). supra]. Therefore, the overexpression of the Ad-polymerase. must 
have allowed a very low level but detectable production of infectious H5.vmAI00 particles in 
the B-6 cell line. The reduced growth of H5.sz/Z>100 is therefore not due to a replication 
defect, but rather some other critical activity that the preterminal protein has a role in. such as 

10 augmention of viral transcription by association with the nuclear matrix [Schaack and Shenk 
(1989) Curr. Top. Microbiol. Immunol. 144:185 and Hauser and Chamberlain (1996) J. Endo. 
149:373]. This was confirmed by assessing the ability of the C-7 cell line to overcome the 
growth defect of HSsublOO utilizing one-step growth assays performed as follows. 

Each of the cell lines (LP-293. B-6 and C-7) were seeded onto 60 mm dishes at 2.0 x 

15 10* cells/dish. The cell lines were infected at an MOI of 4 with each of the appropriate 
viruses (wiAd5. H5ts36, or HS^/AIOO). and incubated at 38.5°C for 40 hours. The total 
amount of infectious virions produced in each 60 mm dish was released from the cell lysates 
by three cycles of freeze-thawing. and the titer was then determined by limiting dilution and 
plaque assay on B-6 cells at 38.5°C. The results are summarized in Figure 8. 

-0 The results shown in Figure 8. demonstrated that even though the B-6 cell line allowed 

normal replication and plaque formation of HSsublOO at 38.5°C (in fact. B-6 cells were 
utilized to determine the plaque titres depicted in Figure 8) they could not allow high level 
growth of H5.vmA100 and only produced titres of HSsublOO equivalent to that produced by the 
LP-293 cells. The C-7 cell line produced 100 fold more virus than the LP-293 or B-6 cells, 

25 see Figure 8. Encouragingly, the titre of HSsublOO produced by the C-7 cells approached 
titres produced by LP-293 cells infected with wild-type virus. Ad5. When the HSsublOO 
virions produced from infection of the C-7 cells were used to infect LP-293 cells at 38.5°C 
all virus produced retained the rs mutation (i.e.. at least a 1000 fold drop in pfu was detected 
when LP-293 cells were respectively infected at 38.5°C vs. 32.0°C). This finding effectively 

30 rules out the theoretical possibility that the H5swM00 input virus genomes recombined with 
the preterminal protein sequences present in the C-7 cells. 

In addition, the C-7 cell line allowed the high level growth of H5ts36. demonstrating 
that adequate amounts of the Ad-polymerase protein were also being expressed. The C-7 cell 
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line was capable of irans-complememing the growth of both H5ts36 and H5sub\00 after 4 
months of serial passaging, demonstrating that the compression of the El. preterminal, and 
polymerase proteins was not toxic. 

These results demonstrate that the constitutive expression of both the polymerase and 
preterminal proteins is not detrimental to normal virus production, which might have occurred 
if one or both of the proteins had to be expressed only during a narrow time period during the 
Ad life cycle. In summary, these results demonstrated that the C-7 cell line can be used as a 
packaging cell line to allow the high level g r0 wi h of El. preterminal, and polymerase deleted 
Ad vectors. 

EXAMPLE 3 

Production Of Adenovirus Vectors 
Deleted For El And Polymerase Functions 

In order to produce an Ad vectors deleted for El and polymerase functions, a small, 
frame-shifting deletion was introduced into the Ad-pol gene contained within an El-deleted 
Ad genome. The plasmid pBHGl 1 (Microbix) was used as the source of an El -deleted Ad 
genome. pBHGl 1 contains a deletion of Ad5 sequences from bp 188 to bp 1339 (0.5-3.7 
m.u.): this deletion removes the packaging signals as well as El sequences. pBHGl 1 also 
contains a large deletion within the E3 region (bp 27865 to bp 30995: 77.5-86.2 m.u.). The 
nucleot.de sequence of pBHGl 1 is listed in SEQ ID NO:4 [for cross-corrleation between the 
pBHGl I sequence and the Ad5 genome (SEQ ID NO:l). it is noted that nucleotide 8.773 in 
pBHGl 1 is equivalent to nucleotide 7.269 in Ad5]. 

pBHGll was chosen to provide the Ad backbone because this plasmid contains a large 
deletion within the E3 region (77.5 to 86.2 m.u.) and therefore vectors derived from this 
plasmid permit the insertion of large pieces of foreign DNA. A large cloning capacity is 
important when the pol" vectors is to be used to transfer a large gene such as the dystrophin 
gene (cDNA = 13.6 kb). However, the majority of genes are not this large and therefore 
other Ad backbones containing smaller deletions within the E3 region {e.g.. pBHGIO which 
contains a deletion between 78.3 to 85.8 m.u.; Microbix) may be employed for the 
construction of pol" vectors using the strategy outlined below. 
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j) Construction Of A Plasmid Containing A Portion Of The 
Adenovirus Genome Containing The Polymerase Gene 

A fragment of the Ad genome containing the pol gene located on pBHGl I was 
subcloned to create pBSA-XB. Due to the large size of the Ad genome, this intermediate 

5 plasmid was constructed to facilitate the introduction of a deletion within the pol cene the pol 
deletion. pBSA-XB was constructed as follows. The polylinker region of pBluescript 
(Stratagene) was modified to include additional restriction enzyme recognition sites (the 
sequence of the modified polylinker is provided in SEQ ID NO:5: the remainder of 
pBluescript was not altered); the resulting plasmid was termed pBSX. pBHGl 1 was dieested 

10 with Xbal and BamHl and the 20.223 kb fragment containing the pol and pTP coding regions 
(E2b region) was inserted into pBSX digested with Xbal and BamHl to generate pBSA-XB. 

b) Construction Of pBHG 1 1 Apol 

A deletion was introduced into the pol coding region contained within pBSA-XB in 
15 such a manner that other key viral elements were not disturbed {e.g., the major late promoter, 
the tripartite leader sequences, the pTP gene and other leader sequences critical for normal 
virus viability). The deletion of the pol sequences was carried out as follows. pBSA-XB was 
digested with BspEl and the ends were filled in using T4 DNA polymerase. The BspEl- 
digested. T4 polymerase filled DNA was then digested with BamHl and the 8809 bp 
20 fl«mHI/Zfcy>EI(tlIled) fragment was isolated as follows. The treated DNA was run on a 0.6% 
agarose gel (TAE buffer) and the 8809 bp fragment was excised from the gel and purified 
using a QIAEX Gel Extraction Kit according to the manufacturer's instructions (OIAGER 
Chatsworth. CA). 

A second aliquot of pBSA-XB DNA was digested with BspHl and the ends were filled 
25 in with T4 DNA polymerase. The &vpHI-digested. T4 polymerase filled DNA was then 
digested with BamHl and the 13.679 bp BamHVBspHl( filled) fragment was isolated as 
described above. 

The purified 8809 bp Bam HVBspEK filled) fragment and the purified 13.679 bp 
5a/wHI/#.v/?Hl(ftlled) fragment were ligated to generate pApol. pApol contains a 612 bp 
30 deletion within the pol gene (bp 8772 to 9385; numbering relative to that of pBHGl 1) and 
lacks the 1 1.4 kb BamHl fragment containing the right arm of the Ad genome found within 
pBHGIl. 
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To provide ,he „ s h, arm or lhe Ad ?enom5 . p4po| _ di?esKd ~; 5 - 

lT7 f ' Ca "' in,eS ' ina, — P B HO' ' «- ^«,ed with 8amH , and 

II .4 kb ,ragt„e„, was isotaed and puri(ied us , n? a ^ &| ^^.^ ^ 

descr,bed above. The purified 1 1.4 kb fl„„,HI .ragmen, was Uua.ed ,„ ,„e 
/W,H„p„os P ha,ased pApol ,o 6 e„era,e pBHGI lApol. Proper construction of pBHG, Mpo, 
was conhrmed restriction digestion (ttndlll). 

c) Rescue And Propagation Of AdSApol Virus 

The Ad genome contained within pBHOI lApol !acks ,he packattintt signals 

PBHG, iApoi. , hls p ,a Sm , d must be colransfccled jmo packagmB k||s aiong wA 
prov, es a source of ,he Ad packagmg signals. The Ad packaging 5 i 8Ml s m ay be provided 
by w,,d.,vpe or ™„, Ad „ral DNA or ahernativety may be provided using a shuttle vector 
"Inch con.ains ,he ief.-end Ad5 sequences including ,he packagi„ a signals such as 
P4EISPIA. P AE,s P , B (Microbix, or pAdBglll (pAdBgl,! is a s,andard shunie vecor which 
con,a,ns 0- 1 m.u. and 9. 1 6 m.u. of ,he adenovirus genome). 

To rescue virus. pBHGI lApol was co-.ransfeced with Ad5<//70OI viral DNA 
Ado,™, contains a de.etion in the E3 re S ,„n : the E5 deietion contained within Ad5^700, 
« smaUer than ,he deicion contained within the Ad genome contained within pBHGI . ,« is 
no, necessary that AdMTOOl be used to recover virus: other adenoviruses, induding wild- 
type adenoviruses, may be used to rescue of virus from pBHGI lApol. 

I. has been reported that the generation of recombinant Ads is more efficient if Ad 
DNA,erm,„al protein compiex ,TPC, is employed ,„ conjunction with a plasmid comaining 
.he des,red dele,ion |Miyake „ „/. (,9%, Proc. NaU. Acad. Sci. USA 93:1320]. Accordingly 
Ad5 dl 700l-TPC were prepared as described (Miyake „ al ,,996). ,,„,„]. Briedv. purified 
Ad^/7001 virions (purified through an isopycnic CsCI gradient centered a, , 34 g/ ml ) were 
lysed by the addition of an equal volume of 8 M guanidine hydrochloride. The released 
Ad 5l //700l DNA-TPC was then purified through a buoyant densitv gradient of 7 8 M CsCIM 
M guanidine hydrochloride by centrifugation for 16 hr a, 55.0OO rpm in a VTi65 rotor 
(Beckman,. Gradient fractions containing AtiMTOOl DNA-TPC were identified using an 
e.h,d,u„, bromide spot test and then pooled, dialyzed extensively against TE buffer BSA was 
then added to a final concemra.ion of 0.5 mg/ml and aliquots were s,ored a, -80°C The 
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Ad5t//700i DNA-TPC was then digested with Seal and then gel-filtered throuuh a Sephadex 
G-50 spin column. 

One hundred nanograms of the digested Ad5<//700l DNA-TPC was mixed with 5- pa 
of pBHGI lApol and used to transfect pol-expressing 293 cells (i.e.. C-7V Approximately 10 

5 days post-transfection. plaques were picked. The recombinant viruses were plaque purified 
and propagated using standard techniques (Graham and Prevac. supra). Viral DNA was 
isolated, digested with restriction enzymes and subjected to Southern blottinu analysis to 
determine the organization of the recovered viruses. Two forms of virus containinc the 612 
bp pol deletion were recovered and termed Ad5ApolAE3I and Ad5ApolAE3U. One form of 

10 recombinant pol" virus recovered. Ad5ApolAE3I. underwent a double recombination event 
with the Ad5c//700l sequences and contains the E3 deletion contained within Ad5 J/7001 at 
the right end of the genome. The second form of recombinant poi~ virus. Ad5ApolAE3II. 
retained pBHGI 1 sequences at the right end of the genome (i.e.. contained the E3 deletion 
found within pBHGI I). These results demonstrate the production of a recombinant Ad vector 

15 containing a deletion within the pol gene. 

d) Characterization Of The E1+, pol- Viruses 

To demonstrate that the deletion contained within these two pol" viruses renders the 
virus incapable of producing functional polymerase. Ad5ApolAE31 was used to infect 293 

20 cells and pol-expressing 293 cells (B-6 and C-7 cell lines) and a viral replication- 
complementation assay was performed as described in Example le. Briefly. 293. B-6 and C-7 
cells were seeded onto 60 mm dishes at a density of 2 x 106 cells/dish and infected with 
H5.y?«A100 or Ad5ApolAE3I at an MOI of 0.25. The infected cells were then incubated at 
37°C or 38.5°C for 16 hours, or at 32.0°C for 40 hours. Cells from each infected plate were 

25 then harvested and total DNA was extracted. Four micrograms of each DNA sample was 
digested with HindllL electrophoresised through an agarose gel. transferred to a nylon 
membrane and probed with - : P-Iabeled adenoviral DNA. The resulting autograph is shown in 
Figure 9. In Figure 9, each panel shows, from left to right, DNA extracted from 293, B-6 
and C-7 cells, respectively infected with either H5sub\00 or Ad5ApolAE3l (labeled 

30 Ad5APOL in Fig. 9). 

As shown in Figure 9. the recombinant pol" virus was found to be viable on pol- 
expressing 293 cells but not on 293 cells. These results demonstrates that recombinant Ad 
viruses containing the 612 bp deletion found within pApol lack the ability to express Ad 
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express.on of ,a,e S ene produc.s was exam.ned. 293 and C-7 cells were ,„fec,ed wi,h 
AdMpol.B, and 24 hours af.er ,„fec,io„ ce„ ex„ac,s were prepared. The cel, ex,r«s were 
-a,,, u.ed and earned for , h e expression of ,He f.ber pro.ein ( a ,a,e e e„e prCuc, b 
,mmu„„ W o, analys, The immunoblo, was performed as desenbed in E.mp.e 3C wi.h ,he 
<xcep„o„ ihauhe prima. a„,,body used was an anfi-Hher an„body (FIBER-KNOB ob.ained 
fro, Rober, Gerard. U„iversi ly of Texas Sou.nwes.ern Med.ca, Schoo,, The resuhs of , s 
■mmunoblouin, ana.ysis revealed ,ha, no liber proiein was dceced from :o 3 cells infeced 
w„h Aa 5iip „, 4E 3, (at anv dilu , u , on 0 ,, he m|| exiraci) |n _ ^ 

of cel. ex.rac, prepared from C7 cells infec.ed wi, h Ad 5 Apo, AE 3, produced a visible band 
on ,he .mmunoblo,. Therefore. ,he po, detaion comained wuhin Ad 34poliE 3, resuhed in a 
grea.er .han 1000-fold decrease in fiber produc.ion. 

The above resuhs demons,™ iha, polymerase S e„e sequences can be deleied from ,he 
vrus and ,ha, ,he resul.in, delered v.rus w„, only grow on cells producn, Ad-polvmerase ,„ 
'ran,. Usmg ,he pol-expressi„ 8 cell lines descr.bed herein ley.. B-6 and C-7) large 
usuries of ,he por v.ruses can be prepared. A dramar.c shu.-down in g row,h and ,a,e gene 
express,o„ is seen when cells which do no, express Ad polymerase are mfeced wi,h ,he po|- 
viruses. 



e) Generation Of El Pol" Ad Vectors 

Ad5<//7001 used above to recuse virus containing the polymerase deletion is an El- 
conuumng virus. The presence of El sequences on the recombinant pel" viruses is 
undes.rable when the recombinant virus is to be used to transfer genes into the tissues of 
ammals: the El region encodes the transforming genes and such viruses replicate extreme* 
well ,„ l7W leading to an immune response directed against cells infected with the El- 
containing virus. 



E I " v.ruses containing the above-described polymerase deletion are senerated as 
follows. pBHGl I Apol is cotransfected into pol-expressing 293 cells (*.„.. B-6 or C-7) along 
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with a shuttle vector containing the left-end Ad5 sequences including the packaging signals. 
Suitable shuttle vectors include pAElspIA (Microbix). pAElsplB (Microbix) or pAdBglll. 
The gene of interest is inserted into the polylinker region of the shuttle vector and this 
plasmid is then cotransfected into B-6 or C-7 cells along with pBHGl I Apol 10 szenerate a 
recombinant El", pol" Ad vector containing the gene of interest. 

EXAMPLE 4 

Production Of Adenovirus Vectors 
Deleted For El And Preterminal Protein Functions 

In order to produce an Ad vectors deleted for El and preterminal protein functions, a 
small deletion was introduced into the Ad preterminal protein (pTP) gene contained within an 
El -deleted Ad genome. The plasmid pBHGl 1 (Microbix) was used as the source of an El- 
deleted Ad genome to maximize the cloning capacity of the resulting pTP" vector. However, 
other Ad backbones containing smaller deletions within the E3 region (e.g., pBHGlO which 
contains a deletion between 78.3 to 85.8 m.u.; Microbix) may be employed for the 
construction of pTP" vectors using the strategy outlined below. 

a) Construction Of pApTP 

A deletion was introduced into the pTP coding region contained within pBSA-XB (Ex. 
3) in such a manner that other key viral elements were not disturbed (e.g.. the tripartite leader 
sequences, the i-leader sequences, the VA-RNA 1 and II genes, the 55 kD gene and the pol 
gene). The deletion of the pTP sequences was carried out as follows. pBSA-XB was 
digested with Xbal and EcoRV and the 7.875 kb fragment was isolated as described (Ex. 3). 
Another aliquot of pBSA-XB was digested with Muni and the ends were Filled in using T4 
DNA polymerase. The <V/wiI -digested. T4 polymerase filled DNA was then digested with 
Xbal and the 14.894 kb XbaVMunKfilled) fragment was isolated as described (Ex. 3). The 
7.875 kb Muni fragment and the 14.894 kb Xha\IMunl{ filled) fragment were ligated together 
to generate pApTP. 
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b) Construction Of pBHG U ApTP 

pApol contains a 429 bp deletion within the pTP gene (bp 10.705 to I 1.134: 
numbering relative to that of pBHGl I) and lacks the 11.4 kb flcwnHI. fragment containing the 
right arm of the Ad genome found within pBHGl 1. 

To provide the right arm of the Ad genome. pApTP was digested with BomHl 
followed by treatment with shrimp alkaline phosphatase (SAP; U.S. Biochemicals. Cleveland. 
OH). pBHGl 1 was digested with BamHl and the 1 1.4 kb fragment was isolated and purified 
using a QIAEX Gel Extraction Kit as described (Ex. 3). The purified 1 1.4 kb BamHl 
fragment was ligated to the S^iHI/phosphatased pApTP to generate pBHGl lApTP. Proper 
construction of pBHGl lApTP was confirmed restriction digestion. 

c) Rescue And Propagation Of Ad Vectors Containing The pTP 
Deletion 

The Ad genome contained within pBHGl lApTP lacks the Ad packaging signals. 
Therefore, in order to recover virus containing the 612 bp deletion within the pol gene from 
pBHGl lApol. this plasmid must be cotransfected into packaging cells along with DNA that 
provides a source of the Ad packaging signals. The Ad packaging signals may be provided 
by wild-type or mutant Ad viral DNA or alternatively may be provided using a shuttle vector 
which contains the left-end Ad5 sequences including the packaging signals such as 
pAElsplA. pAElsplB (Microbix) or pAdBglll. 

Recombinant Ad vectors containing the pTP deletion which contain a deletion within 
the E3 region are generated by cotransfection of pBHGl 1 ApTP (the gene of interest is 
inserted into the unique Pad site of pBHGl lApTP) with a E3-deleted Ad virus such as 
Ad5dl700 into pTP-expressing 293 cells (e.g.. C-7); viral DNA-TPC are utilized as described 
above in Example 3. 

Recombinant vectors containing the pTP deletion which also contain deletions within 
the EI and E3 regions are generated by cotransfection of pBHGl lApTP into pTP-expressing 
293 cells (e.g.. C-7) along with a shuttle vector containing the left-end Ad5 sequences 
including the packaging signals. Suitable shuttle vectors include pAElsplA (Microbix). 
pAElsplB (Microbix) or pAdBglll. The gene of interest is inserted into the polylinker region 
of the shuttle vector and this plasmid is then cotransfected into B-6 or C-7 cells along with 
pBHGl lApTP to generate a recombinant El". pTP" Ad vector containing the gene of 
interest. 
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EXAMPLE 5 

Production Of Adenovirus Vectors Deleted For 
El. Polymerase And Preterminal Protein Functions 

In order to produce an Ad vectors deleted for El. polymerase and preterminal protein 
functions, a deletion encompassing pol and pTP gene sequences was introduced into the Ad 
sequences contained within an EI -deleted Ad genome. The plasmid pBHGl IAE4 was used 
as the source of an El -deleted Ad genome to maximize the cloning capacity of the resulting 
pol". pTP" vector. pBHGl IAE4 is a modified form of BHG1 1 which contains a deletion of 
all E4 genes except for the E4 ORE 6; the E4 region was deleted to create more room for the 
insertion of a gene of interest and to further disable the virus. However, other El" Ad 
backbones, such as pBHGl 1 and pBHGl (Microbix; pBHGlO contains a smaller deletion 
within the E3 region as compared to pBHGll). may be employed for the construction of 
pol". pTP" vectors using the strategy outlined below. 

Due to the complexity of the cloning steps required to introduce a 2.3 kb deletion that 
removes portions of both the pTP and pol genes, this deletion was generated using several 
steps as detailed below. 

a) Construction Of pAXBApolApTPVARNA+tl3 

In order to create a plasmid containing Ad sequences that have a deletion within the 
pol and pTP genes, pAXBApolApTPVARNA+tl 3 was constructed as follows. 

pBSA-XB was digested with BspEl and the 18 kb fragment was isolated and 
recircularized to create pAXBApolApTP; this plasmid contains a deletion of the sequences 
contained between the BspEl sites located at 8.773 and 12.513 (numbering relative to 
pBHGl 1). 

A fragment encoding the VA-RNA3 sequence and the third leader of the tri-partite 
leader sequence was prepared using the PCR as follows. The PCR was carried out in a 
solution containing H5ts36 virion DNA (any Ad DNA. including wild-type Ad. may be used), 
2 ng/mL of primers 4005E and 4006E. 10 mM TrisHCl. pH 8.3. 50 mtvl KCI. 1.5 mM 
MgCL 0.001% gelatin and Pfu polymerase. The forward primer. 4005E, [5*- 
TGCCGCAGCACCGGATGCATC-3* (SEQ ID NO:6)j contains sequences complementary to 
residues 12.551 to 12.571 of pBHGl 1 (SEQ ID NO:4). The reverse primer. 4006E, [5*- 
GCGTCCGGAGGCTGCCATG CGGCAGGG-3' (SEQ ID NO:7)] is complementary to 
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res.dues 1 1.091 to 1 1. 1 08 of pBHGl 1 fSEQ ID NO:4) as well as a BspEl sue (underlined). 
The predicted sequence of the -1.6 kb PCR product is listed in SEQ ID NO:8. 

PCR was performed with a Perkin Elmer 9600 Thermocycler utilizing the following 
cycling parameters: initial denaturation at 94°C for 3 min. 3 cycles of denaturation at 94°C 
for 30 sec. annealing at 50°C for 30 sec. and extension at 72°C for 60 sec. followed by 
another 27 cycles with an increased annealing temperature at 56°C. with a final extension at 
72°C for 10 minutes. The -1.6 kb PCR product was purified using a QlAEX Gel Extraction 
Kit as described (Ex. 3). 

The purified PCR fragment was then digested with BspEl. pAXBApolApTP was 
digested with BspE\ followed by treatment with SAP. The fopEI-digested PCR fragment and 
the Z?.vpEI-SAP-treated pAXBApolApTP were ligated together to create 
pAXBApol ApTP V ARN A+t 1 3. 

b) Construction Of pBHGHApolApTPVARNA+tl3 

pAXBApolApTPVARNA+tl3 contains a 2.3 kb deletion within the pol and pTP genes 
(bp 8.773 to 1 1.091: numbering relative to that of pBHGl I) and lacks the 1 1.4 kb BamHl 
fragment containing the right arm of the Ad genome. 

To provide the right arm of the Ad genome. pAXBApolApTPVARNA+tl3 was 
digested with BamHl followed by treatment with SAP. pBHGl 1AE4 (pBHGll orpBHGlO 
may be used in place of pBHGl 1AE4) was digested with BamHl and the 1 1.4 kb fragment 
was isolated and purified using a QlAEX Gel Extraction Kit as described (Ex. 3). The 
purified 11.4 kb BamHl fragment was ligated to the flfl/wHI/phosphatased 
pAXBApolApTPVARNA+tl3 to generate pBHGl 1 ApolApTPVARMA+tl 3. Proper 
construction of pBHGl 1 ApolApTPVARNA+tl3 was confirmed restriction digestion. 

c) Rescue And Propagation Of Ad Vectors Containing The pol, 
pTP Double Deletion 

The Ad genome contained within pBHG 1 1 ApolApTPVARNA+t 1 3 lacks the Ad 
packaging signals. Therefore, in order to recover virus containing the 2.3 kb deletion within 
the pol and pTP genes from pBHGl lApolApTPVARNA+tl3. this plasmid must be 
cotranstected into packaging cells along with DNA that provides a source of the Ad 
packaging signals. The Ad packaging signals may be provided by wild-type or mutant Ad 
viral DNA or alternatively may be provided using a shuttle vector which contains the left-end 



- 45 - 



WO 98/17783 PCT/US97/ 19541 

Ad5 sequences including ihe packaging signals such as pAElsplA. pAEIsplB (Microbix) or 
pAdBglll. 

Recombinant vAd vectors containing the pol. pTP double deletion which also contain a 
deletion within the E3 region are generated by cotransfection of 
5 pBHGl i ApolApTPVARNA+tl3 (the gene of interest is inserted into the unique Pad site of 
pBHGI 1 ApolApTPVARNA+tl3) with a E3-deleted Ad virus such as Ad5rf/7001 into pol- and 
pTP-expressing 293 cells (e.g.. C-7); viral DNA-TPC are utilized as described (Ex. 3). 

Recombinant vectors containing the pol. pTP double deletion which also contain 
deletions within the El and E3 regions are generated by cotransfection of 
10 pBHGI lApolApTPVARNA+tl3 into pol- and pTP-expressing 293 cells (e.g.. C-7) alone with 
a shuttle vector containing the left-end Ad5 sequences including the packaging signals. 
Suitable shuttle vectors include pAEIsplA (Microbix), pAElspIB (Microbix) or pAdBglll. 
The gene of interest is inserted into the polylinker region of the shuttle vector and this 
plasmid is then cotransfected into B-6 or C-7 cells along with 
15 pBHGI !ApolApTPVARNA+tl3 to generate a recombinant El", pol pTP" Ad vector 
containing the gene of interest. 

EXAMPLE 6 

Encapsidated Adenovirus Minichromosomes 
20 Containing A Full Length Dystrophin cDNA 

In this Example, the construction of an encapsidated adenovirus minichromosome 
(EAM) consisting of an infectious encapsidated linear genome containing Ad origins of 
replication, packaging signal elements, a (i-galactosidase reporter gene cassette and a full 

25 length (14 kb) dystrophin cDNA regulated by a muscle specific enhancer/promoter is 

described. EAMs are generated by cotransfecting 293 cells with supercoiled plasmid DNA 
(pAd5(idys) containing an embedded inverted origin of replication (and the remaining above 
elements) together with linear DNA from El -deleted virions expressing human placental 
alkaline phosphatase (hpAP). All proteins necessary for the generation of EAMs are provided 

30 in trans from the hpAP virions and the two can be separated from each other on equilibrium 
CsCI gradients. These EAMs are useful for gene transfer to a variety of cell types both in 
vitro and in vivo. 
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a> Generation And Propagation Of Encapsidated Adenovirus 
iMinichromosomes 

To establish a vector system capable of delivering full length dystrophin cDNA clones, 
the minimal region pi Ad5 needed for replication and packaging was combined with a 
conventional plasmid carrying both dystrophin and a p-galactosidase reporter gene. In the 
first vector constructed, these elements were arranged such that the viral ITRs Hanked (ITRs 
facing outward) the reporter gene and the dystrophin gene (/.c. the vector contained from 5" 
to y the right or 3" ITR (mu 100 to 99). the dystrophin gene and the reporter gene and the 
left or 5" ITR and packaging sequnce (mu I to 0)]. Upon introduction of this vector along 
with helper virus into 293 cells, no encapsidated adenovirus minichromosomes were 
recovered. 

The second and successful vector. P Ad5pdys (Fig. 10) contains 2. 1 kb of adenovirus 
DNA. together with a 14 kb murine dystrophin cDNA under the control of the mouse muscle 
creatine kinase enhancer/promoter, as well as a p-galactosidase gene regulated by the human 
cytomegalovirus enhancer/promoter. 

Figure 10 shows the structure of P Ad5pdys (27.8 kb). The two inverted adenovirus 
origins of replication are represented by a left and right inverted terminal repeat 
(LITR/RJTR). Replication from these ITRs generates a linear genome whose termini 
correspond to the 0 and 100 map unit (mu) locations. Orientation of the origin with respect 
to wild type adenovirus serotype 5 sequences in mu is indicated above the figure (1 mu=360 
bp). IZncapsidation of the mature linear genome is enabled by five (Al-AV) packaging signals 
( V H). The E. coli P-galactosidase and mus musculus dystrophin cDNAs are regulated by 
cytomegalovirus (CMV) and muscle creatine kinase (MCK) enhancer/promoter elements, 
respectively. Both expression cassettes contain the SV40 polyadenylation (pA) signal. Since 
the El A enhancer/promoter overlaps with the packaging signals. pAd5(Jdys was engineered 
such that RNA polymerase transcribing from the El A enhancer/promoter will encounter the 
SV40 late polyadenylation signal. Pertinent restriction sites used in constructing pAd5pdys 
are indicated below the figure. An adenovirus fragment corresponding to mu 6.97 to 7.77 
was introduced into pAd5pdys during the cloning procedure (described below). PI and P2 
represent location of probes used for Southern blot analysis. Restriction sites destroyed 
during the cloning of the Ad5 origin of replication and packaging signal are indicated in 
parentheses. 
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pAdipdys was constructed as follows. pBSX (Ex. 3a) was used as the backbone for 
construction of pAd5Pdys. The inverted fused Ad5 origin of replication and five 
encapsidation signals were excised as a Psti/Xbal fragment from pAdSori. a plasmid 
containing the 6 kb HindlU fragment from pFGI40 (Microbix Biosystems). This strategy also 

5 introduced a 290 bp fragment from Ad5 corresponding to map units 6.97 to 7.77 adjacent to 
the right inverted repeat (see Fig. 10). The £ coli p-galactosidase gene regulated by the 
human CMV immediate early (IE) promoter/enhancer expression cassette was derived as an 
EcoR\IHind\U fragment from pCMVp [MacGregor and Caskey (1989) Nucleic Acids Res. 
17:2365]; the CMV IE enhancer/promoter is available from a number of suppliers as is the £. 

10 coli p-galactosidase gene]. The murine dystrophin expression cassette was derived as a 
BssHU fragment from pCVAA. and contains a 3.3 kb MCK promoter/enhancer element 
[Phelps ei ai ( 1995) Hum. Mol. Genet. 4:1251 and Jaynes et ai (1986) Mol. Ceil. Biol. 
6:2855]. The sequence of the -3.3 kb MCK promoter/enhancer element is provided in SEQ ID 
NO:9 (in SEQ ID NO:9. the last nucleotide of SEQ ID NO:9 corresponds to nucleotide +7 of 

15 the MCK gene). The enhancer element is contained within a 206 bp fragment located -1256 
and -1050 upstream of the transcription start site in the MCK gene. 

It was hoped that the pAd5pdys plasmid would be packageable into an encapsidated 
minichromosome when grown in parallel with an El -deleted virus due to the inclusion of both 
inverted terminal repeats (ITR) and the major Ad packaging signals in the plasmid. The ITRs 

20 and packaging signals were derived from pFG140 (Microbix). a plasmid that generates EI- 
defective Ad particles upon transfection of human 293 cells. 

hpAP is an El deleted Ad5 containing the human placental alkaline phosphatase gene 
[Muller ei ai (1994) Circ. Res. 75:1039]. This virus was chosen to provide the helper 
functions so that it would be possible to monitor the titer of the helper virus throughout serial 

25 passages by quantitative alkaline phosphatase assays. 

293 cells were cotransfected with pAdSpdys and hpAP DNA as follows. Low passage 
293 cells (Microbix Biosystems) were grown and passaged as suggested by the supplier. Five 
pAdSpdys and hpAP DNA (5 and 0.5 ng, respectively) were dissolved in 70 (il of 20 mM 
HEPES buffer (pH 7.4) and incubated with 30 ^1 of DOTAP (BMB) for 15 min. at room 

30 temperature. This mixture was resuspended in 2 mis of DMEM supplemented with 2% fetal 
calf serum (FCS) and added dropwise to a 60 mm plate of 293 cells at 80% confluency. Four 
hours post-transfection the media was replaced by DMEM with 10% FCS. Cytopathic effect 
was observed 6-12 days post-transfection. 
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Cotransfection of 293 cells with supercoiled pAd5pdys and linear hpAP DNA 
produced Ad.5pd.ys EAMs (the encapsidated version of pAdSpdys) approximately 6 to 1 2 days 
post transfection. as evidenced by the appearance of a cyiopathic effect (CPE). Initially, the 
amount of Ad5pdys EAMs produced was significantly lower than that of hpAP virions so the 
viral suspension was used to re-infect fresh cultures, from which virus was isolated and used 
for serial infection of additional cultures (Fig. 1 1). Infection and serial passaging of 293 cells 
was carried out as follows. 

Lysate from one 60 mm plate of transfected 293 cells was prepared by vigorously 
washing the cells from the plate and centrifuging at 1 K rpm in a clinical centrifuge. Cells 
were resuspended in DM EM and 2% FCS. freeze-thawed in a dry ice-ethanol bath, cell debris 
removed by centrifugation. and approximately 75% of the crude lysate was used to infect 293 
cells in DM EM supplemented with 2% FCS for I hour and then supplemented with 10% FCS 
thereafter. Infection was allowed to proceed for 18-20 hrs before harvesting the virus. The 
total number of cells infected in each serial passage is indicated in Figure II. 

In Figure 1 1. the total number of transducing adenovirus particles produced (output) 
per serial passage on 293 cells, total input virus of either the helper (lipAP) or Ad5pdys. and 
the total number of cells used in each infection is presented. The total number of input/output 
transducing particles were determined by infection of 293 cells plated in 6-well microliter 
plates. Twenty four hours post-infection the cells were assayed for alkaline phosphatase or p- 
galactosidase activity (described below) to determine the number of cells transduced with 
either Ad5pdys or hpAP. The number of transducing particles were estimated by 
extrapolation of the mean calculated from 31 randomly chosen 2.5 mm : sectors of a 961 mnr 
plate. The intra-sector differences in total output of each type of virus are presented as the 
standard deviations, a. in Figure 11. For each serial passage. 75% of the total output virus 
from the previous passage was used for infection. 

Alkaline phosphatase or P-galactosidase activity was determined as follows. For 
detection of alkaline phosphatase, infected 293 cells on Petri dishes were rinsed twice with 
phosphate buffered saline (PBS) and fixed for 10 minutes in 0.5% glutaraldehyde in PBS. 
Cells were again rinsed twice with PBS for ten minutes followed by inactivation of 
endogenous alkaline phosphatase activity at 65°C for I hr. in PBS prior to the addition of the 
chromogenic substrate BCIP (5-bromo-4-chloro-3-indolyl phosphate) at 0.15 mg/ml and nitro 
blue tetrazolium at 0.3 mg/ml) Cells were incubated at 37°C in darkness for 3- 24 hrs. For 
P-galactosidase assays, the cells were fixed and washed as above, then assayed using standard 
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methods [MacGregor et al (1 99 1) In Murray (ed.). Methods in Molecular Biology; Vol ': 
Gene Transfer and Expression Protocols, Human Press Inc.. Clifton. NJ. pp. 217-225]. 

As shown in Figure 1 1. the rate of increase in titer of AdSpdys EAMs between 
transfection and serial passage 6 was approximately 100 times greater than that for hpAP 
virions. This result indicated that AdSpdys has a replication advantage over the helper virus, 
probably due to the shorter genome length (a difference of approximately 8 kb) and hence an 
increased rate of packaging. 

Interestingly, after serial passage 6 there was a rapid decrease in the total titer of hpAP 
virions whereas the titer of AdSpdys EAMs continued to rise. While not limiting the present 
invention to any particular mechanism, at least two possible mechanisms could be responsible 
for this observation. Firstly, a buildup of defective hpAP virions due to infections at hieh 
multiplicities may slowly out-compete their full length counterparts, a phenomenon that has 
been previously observed upon serial propagation of adenovirus [Daniell (1976) J. Virol. 
19:685: Rosenwirth et al (1974) Virology 60:431: and Burlingham et al (1974) Virology 
60:419]. Secondly, the emergence of replication competent virions due to recombination 
events between El sequences in cellular DNA and the hpAP genome could lead to a buildup 
of virus particles defective in expressing alkaline phosphatase [Lochmuller et al (1994) Hum. 
Gene Ther. 5:1485]. 

Southern analysis of DNA prepared from serial lysates 3. 6. 9 and 12 indicated that 
full length dystrophin sequences were present in each of these lysates (Fig. 12A). In addition, 
the correct size restriction fragments were detected using both dystrophin and p-galactosidase 
probes against lysate DNA digested with several enzymes (Figs. 12A-B). 

Figure 12 shows a Southern blot analysis of viral DNA from lysates 3. 6. 9 and 12. 
digested with the restriction enzymes ZksHlK Nrul and EcoRV. indicating the presence of a 
full length dystrophin cDNA in all lysates. Fragments from the C terminus of mus musculus 
dystrophin cDNA (A) or the N terminus of E coli P-galactosidase (B) were labeled with 
dCTP- ,: and used as probes [Sarnbrook et aL supra). The position of these probes and the 
predicted fragments for each digest is indicated in Figure 10. Note that one end of each 
fragment (except the 17.8 kb B.ssHll dystrophin fragment) detected is derived from the end of 
the linearized AdSpdys genome (see Fig. 10). Low levels of shorter products, presumably 
derived from defective virions, become detectable only at high serial passage number. 

At the later passages (9 and 12) there appeared to be an emergence of truncated 
AdSpdys sequences, suggesting that deletions and/or rearrangements may be occurring at later 
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passages. Hence, most experiments were performed with Adjpdys EAMs derived from the 
earlier passages. The possibility of the emergence of replication competent (El -containing) 
viruses was also examined by infection of HeLa cells by purified and crude serial lysates. 
none of which produced any detectable CPE. 

b) Purification Of Encapsidated Adenovirus Minichromosomes 
On CsCl Gradients 

The ability to separate Ad5pdys EAMs from hpAP virions based on their buoyancy 
difference (due presumably to their different genome lengths) on CsCl gradients was 
10 examined. Repeated fractionation of the viral lysate allows small differences in buoyancy to 
be resolved. CsCl purification of encapsidated adenovirus minichromosomes was performed 
as follows. Approximately 25% of the lysate prepared from various passages during serial 
infections was used to purify virions. Freeze-thawed lysate was centrifuged to remove the 
cell debris. The cleared lysate was extracted twice with 1.1.2 tricholorotrifluoroethane 
15 (Sigma) and applied to CsCl step and self forming gradients. 

Purification of virus was initially achieved by passing it twice through CsCl step 
gradients with densities of p=1.45 and p=1.20 in a SW28 rotor (Beckman). After isolation of 
the major band in the lower gradient, the virus was passed through a self forming gradient 
(initial p=l .334) at 37.000 rpm for 24 hrs followed by a relaxation of the gradient by 
20 reducing the speed to 10.000 rpm for 10 hrs. in a SW41 rotor (Beckman) at 12°C [Anet and 
Strayer (1969) Biochem. Biophys. Res. Commun. 34:328]. The upper band from the gradient 
(composed mainly of Ad5pdys virions) was isolated using an 18 gauge needle, reloaded on a 
„ fourth CsCl gradient (p= 1.334) and purified at 37.000 rpm for 24 hrs followed by 10.000 rpm 
, for 10 hrs at 12°C. 

25 ,,, The AdSpdys-containinc CsCl band was removed in 100 ul fractions from the top of 

the centrifugation tube and CsCl was removed by chromatography on Sephadex G-50 
(Pharmacia). Aliquots from each fraction were used to infect 293 cells followed by 0- 
,. galactosidase and alkaline phosphatase assays to quantitate the level of contamination by 
hpAP virions in the final viral isolate. 

-'° Results of the physical separation between Ad5pdys EAMs and hpAP virions are 

shown in Figure 13. Figure 13 shows the physical separation of Ad5Pdys from hpAP virions 
at the third (A) and final (B) stages of CsCl purification (initial p= 1.334) in a SW4I tube. 
Aliquots of AdSpdys EAMs from the final stage were drawn through the top of the 
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centrifugation tube and assayed for [3-galactosidase and alkaline phosphatase expression. 
Results of these assays are presented in Figure 14. 

In Figure 14. shows the level of contamination of Adipdys EAMs by hpAP virions 
obtained from passage 6. Following four cycles of CsCI purification, aliquots were removed 
from the top of the centrifugation tube and used for infection of 293 cells, which were fixed 
24 hrs later and assayed for P-galactosidase or alkaline phosphatase activity (described below). 
The number of transducing particles are an underestimation of the actual totals as in some 
cases a positive cell may have been infected by more than one transducing particle. The ratio 
of the two types of virions - AdSpdys EAMs (LacZ) or hpAP (AP) in each fraction is 
indicated in the lower graph. 

The maximum ratio of transducing AdSpdys EAMs to hpAP virions reproducibly 
achieved in this study was 24.8 -- a contamination with helper virus corresponding to 
approximately 4% of the final viral isolate. 

c) Dystrophin And /3-galactosidase Expression By Encapsidated 
Adenovirus Minichromosomes In Muscle Cells 

To determine if the AdSpdys EAMs were able to express p-galactosidase and 
dystrophin in muscle cells, mouse mdx myogenic cultures were infected with CsCI purified 
EAMs. 

i) Propagation And Infection Of Muscle Cells 

MM 14 and mdx myogenic cell lines were kindly provided by S. Hauschka (University 
of Washington) and were cultured as previously described [Linkhart.e/ cti (1981) Dev. Biol. 
86:19 and Clegg et al. (1987) J. Cell Biol. 105:949]. Briefly, myoblasts were grown on 
plastic tissue culture plates coated with a 0.1% gelatin in Ham's F-10 medium containing 15% 
(v/v) horse serum, 0.8 mM CaCl-u 200 ng/ml recombinant human basic fibroblast growh 
factor (b-FGF) and 60 ng/ml genitimicin (proliferation medium). Cutures were induced to 
differentiate by switching to growth in the presence of growth medium lacking b-FGF and 
containing 10% horse serum (differentiation medium). Myoblasts or differentiated myotubes 
(three days post switching) were infected at a multiplicity of infection of 2.2 Ad5pdys EAMs 
per cell. Fractions containing minimal contamination with hpAP virions (3. 4 and 5 of 
passage 6) were used for western and immunofluorescence analysis. Infection was allowed to 
proceed for 3 days for both the myoblasts and myotubes before harvesting cells. 
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As shown in Figure 15. expression of (3-galaciosidase was detected in both the infected 
mdx myoblasts and myotubes indicating that the CMV promoter was active at both these earlv 
stages of differentiation in muscle cells. However, only infected mdx myotubes produced 
protein detected by dys-2. an antibody recognizing the 17 C-terminal amino acids of 
dystrophin (Fig. 15). No dystrophin expression was detected in infected myoblasts by western 
analysis, indicating that the muscle creatine kinase promoter functions minimally, if at all. 
within the AdSpdys EAM prior to terminal differentiation of these cells. 

Dystrophin expression in mdx cells infected with EAMs was confirmed by 
immunofluorescence studies using N-terminal dystrophin antibodies. In agreement with the 
western analysis, dystrophin expression from the MCK promoter was detected only in 
differentiated mdx myotubes infected by AdSpdys EAMs (Fig. 16). Figures I6A-C show 
immunofluorescence of dystrophin in wild type MM14 myotubes (A), uninfected mdx (B) and 
infected mdx myotubes (C), respectively. The results shown in Figure 16 demonstrate the 
transfer and expression of recombinant dystrophin to differentiated mdx cells by AdSpdys 
1 5 EAMs. 

Immunofluorescence of myogenic cells was performed as follows. Approximately 1.5 
x 10" MM14 or mdx myoblasts were plated on Poly-L-lysine (Sigma) coated glass slides (7 x 
3 cm) which had been previously etched with a 0.05% chromium potassium sulfate and 0.1% 
gelatin solution. For myotube analysis, the cultures were switched to differentiation media 

20 [Clegg el «/. (1987). supra] 48 hours after plating, immediately infected and then allowed to 
fuse for 3 days, whereas myoblasts were continuously propagated in proliferation media 
[Clegg ei al. (1987). supra]. Cells were washed three times with PBS at room temperature 
and fixed in 3.7% formaldehyde. For immunostaining. cells were incubated in 0.5% Triton 
X-100. blocked with 1% normal goat serum and incubated with an affinity purified antibody 

25 against the N-terniinus of murine dystrophin for 2 hrs. followed by extensive washing in PBS 
and 0.1% Tween-20 with gentle shaking. Cells were incubated with a 1:200 dilution of biotin 
conjugated anti-rabbit antibody (Pierce) for one hour and washed as above. Cells were 
further incubated with a 1:300 dilution of streptavidin-fiuorescein isothiocyanate conjugate 
(Vectorlabs. Burlingam. CA) for one hour and washed as above, followed by extensive 

30 washing in PBS. 

The above results show that embedded inverted Ad origins of replication coupled to an 
encapsidation signal can convert circular DNA molecules to linear forms in the presence of 
helper virus and that these genomes can be efficiently encapsidated and propagated to high 
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cross-hatching. The core promoter element is required in addition to the minimal promoter 
element (which is required for basal expression) in order to achieve increased muscle-specific 
expression in tissue culture [Shield a al. (1996) Mol. Cell. Biol. 16:5058]. The modified 
enhancer element, the 2RS5 enhancer, is indicated by the box labeled "EV The box labeled 
"mi nx" contains a synthetic intron derived from adenovirus and is used to increase expression 
of the constructs (any intron may be utilized for this purpose). The box labeled "CMV 
depicts the CMV IE enhancer/promoter which was used a positive control. 

In figure 17. P-galactosidase activity is expressed relative to the activitv of the wild- 
type MCK. enhancer/promoter construct shown at the top of the figure. As shown in Figure 
17. a construct containing the 2RS5 enhancer (SEQ ID NO: 10) linked to either the minimal 
MCK promoter (a - 261 bp element) or the core and minimal MCK promoter elements (a -539 
bp element) directs higher levels of expression of the reporter gene in muscle cells as 
compared to the -3.3 kb fragment containing the wild-type enhancer element. These modified 
enhancer/promoter elements are considerably smaller than the -3.3 kb fragment used in the 
Ad5|3dys EAM plasmid and are useful for directing the expression of foreign genes in muscle 
cells. These smaller elements are particularly useful for driving the expression of genes in the 
context of self-propagating adenoviral vectors which have more severe constraints on the 
amount of foreign DNA which can be inserted in comparison to the used of "gutted" 
adenoviruses such as the EAMs described above. 

EXAMPLE 7 

Generation Of High Titer Stocks Of Encapsidated Adenovirus 
Minichromosomes Containing Minimal Helper Virus Contamination 

The results presented in Example 6 demonstrated that encapsidated adenovirus 
minichromosomes (EAMs) can be prepared that lack all viral genes and which can express 
full-length dystrophin cDNAs in a muscle specific manner. The propagation of these EAMs 
requires the presence of helper adenoviruses that contaminate the final EAM preparation with 
conventional adenoviruses (about 4% of the total preparation). In this example the EAM 
system is modified to enable the generation of high titer stocks of EAMs with minimal helper 
virus contamination. Preferably the EAM stocks contain helper virus representing less than 
1%. preferably less than 0.1% and most preferably less than 0.01% of the final viral isolate. 
Purified EAMs are then injected in vivo in muscles of dystrophin minus mdx mice to 
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Alternatively. EAMs may be prepared using a packaging cell line that supports high 
efficiency Cre recombinase-mediated excision of the packaging signals from the helper virus 
by transfection of the 0*-expressing cell line with the EAM plasmid followed by infection of 
these cells with /o.v/'-contaning helper virus using an MOI of > 1.0. 
:> Human 293 cell lines that express a variety of levels of Cre recombinase are generated 

as follows. An expression vector containing the Cre coding region. pOG231. was 
cotransfected into 293 cells along with pcDNA3 (Invitrogen); pcDNA3 contains the neo gene 
under the control of the SV40 promoter. pOG23l uses the human CMV enhancer/promoter 
[derived from CDM8 (Invitrogen)] to express a modified Cre gene [pOG231 was obtained 

10 from S. O'Gorman]. The modified Cre gene contained within pOG231 has had a nuclear 

localization signal inserted into the coding region to increase the efficiency of recombination. 

pOG231 was constructed as follows. A Bglll site was introduced into the 5' Xbal site 
of the synthethic intron of pMLSISCAT [Haung and Gorman (1990) Nucleic Acids Res. 
18:937] by linker tailing. A BglU site in the synthethic intron of pMLSISCAT was destroyed 

15 and a BamHl linker was inserted into the Pstl site at the 3* end of the synthethic intron in 

pMLSISCAT. Bglll and Smol sites and a nuclear localization signal were introduced into the 
5* end of pMC-Cre [Gu et ai (1993) Cell 73:1 155] using a PCR fragment that extended from 
the novel Bglll and Smal sites to the BamHl site in the Cre coding region. This PCR 
fragment was ligated to a BamHIISall fragment containing a portion of the Cre coding region 

20 derived from pIC-Cre (Gu et ai. supra) and the intron plus Cre coding sequence was inserted 
into a modified form of pOG44 [O'Gorman et ai (1991) Science 251:1351] to generate 
pOG231. The predicted sequence of pOG231 from the Bglll site to the BamHl site located in 
the middle of the Of coding sequence is listed in SEQ ID NO:l I. 

One 60 mm dish of 293 cells (Microbix) were transfected with 10 jig of Pvulh 

25 linearized pOG231 and 1 |ig of /Vo/Uinearized pcDNA3 using a standard calcium phosphate 
precipitation protocol. Two days after the addition of DNA. the transfected cells were split 
into three 100 mm dishes and 1000 ng/ml of active G418 was added to the medium. The 
cells were fed periodically with G418-contaning medium and three weeks later. 24 G418- 
resistant clones were isolated. 

30 The isolated clones were expanded for testing. Aliquots were frozen in liquid nitrogen 

at the earliest possible passage. The neomycin resistant cell lines were examined for the 
expression of Cre recombinase using following transfection assay. The neomycin resistant 
cells were transfected with PGK-l-GFP-lacZ (obtained from Sally Camper. Univ. of 
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M,ch, g an. Ann Arte. ,V„,. which con.ains a green Huorescen, pro,e,„ |CF P, expression 
casseuc .ha, can be excised bv C„ recombinase ,o *, expression of P , ralK1 „sidas, 
.rans.econ was accomphsed us,n g s,a„dard calcium phospna.c preci P i,a,io„. Fieure ,8 
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'* in "— «"« W «K«« s and P- a al „.,. , IIC Z) sequences are 

ava,a,b=le f rom commmerical sources ,„.,. done.ech. Pa,o AI,o. CA and Pharmacia Bicech 

Piscataway, NJ. respectively). 

Control experiments verified that 293 cells transfected with PGK-l-GFP-.acZ 
expressed significant amounts of P - ga |actosidase only if these cells also expressed Cre 
recombinanse. N a.actos,dase assays were performed as described in Example 6 Neomycin 
res,stant cells expressing Cre recombinase were grouped as high, medium or low expressors 
based upon the amount of P-ga.actosidase activity produced (estimated bv direct countin. of 
P-galactosidase-posiuve cells per high-power field and by observing the .eve. of staining 
the mp-dity with which the blue stain was apparent) when these cel. .i nes were transfected 
w,th the GFP/ P - g a. reporter construe, Thineen positive (,,, Oe-expressi nc) , In es. D608*P 
*7. #22. *I8. #17. *4. W . #2 . U2,n #I3 . M . #I5 and m ^ ^ ^ 

The results of this transection analysis revealed that cultures of 293 cells expressing 
medium to high levels of Cre recombinase could be generated without apparent toxicity. 

b) Generation Of Helper Adenovirus Strains That Contain loxP 
Sites Flanking The Adenovirus Packaging Signals 

Studies of EAM production demonstrated that the EAM vector has a packaging 
advantage over the helper adenovirus (Ex. 6). While not limiting the present invention to a 
part.cu.ar mechanism, it is hypothesized that this packaging and rep.ication advantage can be 
greatly increased by using helper viruses that approach the packaging size limits of Ad5 [Bett 
ei ai (.993) J. Virol. 67:591 1], by using viruses with mutations in E4 and/or E 7 genes [Yeh 
« «/■ < 1996) J. Viro,. 70:559: Gorzig.ia „ al. (.996) J. V iro .. 70:4.73: and Ama.fitano « al 
(1996) Proc. Natl. Acad. Sci. USA 93:3352]. by inclusion of mutations or alterations in the 
packaging signals of the helper virus [Imler ei al. (.995) Hum. Gene Ther. 6:71 1] and by 
combining these strategies. 

The Cre-loxP excision method is used to disable the packaging signals from the helper 
v.rus genomes. The Ad5 packaging domain extends from nucleotide .94 to 358 and is 
composed of five distinct elements that are functionally redundant [Hearine el al (1987) j 
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Virol. 61:2555]. Theoretically, any molecule containing the Ad5 origin of replication and 
packaging elements should replicate and be packaged into mature virions in the presence of 
non-defective helper virus. Disabling the packaging signals should allow replication and sene 
expression to proceed, but will prevent packaging of viral DNA into infectious particles. This 
5 in turn should allow the ratio of EAM to helper virus to be increased greatly. 

To disable the packaging signals within the helper virus used to encapsidate the EAtVls, 
the loxP sequences are incorporated into a helper virus that has a genome approaching the 
maximal packaging size for Ad [-105 map units: Bett ct ai (1993). supra) which further 
decreases the efficiency of helper virus packaging. Final virus size can be adjusted by the 

10 choice of introns inserted into the reporter gene, by the choice of reporter genes, or by 
including a variety of DNA fragments of various sizes to act as "sniffer" fragments. A 
convenient reporter gene is the alkaline phosphatase gene (see Ex. 6). The optimized Cre- 
loxP system is also incorporated into a helper viral backbone containing disruptions of any or 
ail of the E1-E4 genes. Use of such deleted genomes requires viral growth on appropriate 

15 complementing cell lines, such as 293 ceils expressing E2. and/or E4 gene products. The 
loxP sequences are incorporated into the helper virus by placing the loxP 
sequences on either side of the packaging signals on a shuttle vector. This modified shuttle 
vector is then used to recombine with the Ad DNA derived from a virus containing 
disruptions of any or all of the E1-E4 genes to produce the desired helper virus containing the 

20 packaging signals flanked by loxP sequences. 

pAdBglll. an adenovirus shuttle plasmid containing Ad sequences from 0-1 and 9-16 
map units (mu) was used as the starting material. Synthetic oligonucleotides were used to 
create a polylinker which was inserted at 1 mu within pAdBglll as follows. The Bglll LoxP 
oiigo [5*-GAAGATCTATAACTTCGTATAATGTATGCTA 

25 TACGAAGTTATTACCGAAGAAATGGCTCGAGATCTTC-3' (SEQ ID NO: 12) and its 
reverese complement [5*-GAAGATCTCGAGCCATTTCTTCGGTAATAACTTCGT 
ATAGCATACATTATACGAAGTTATAGATCTTC-3 , (SEQ ID NO: 13)] were synthesized. 
The AflHI LoxP oligo [5 ? -CCACATGTATAACTTCGTATAGCATACA 
TT ATACG A AGTT AT AC ATGTGG-3 : (SEQ ID NO: 14)] and its reverse complement [5*- 

30 CCACATGTATAACTTCGTATAATGTATGCTATACGAAGTTATACATG TGG-3' (SEQ 
ID NO: 15)] were synthesized. The double stranded form of each loxP oligonuceotide was 
digested with the appropriate restriction enzyme (e.g.. Bglll or /1//III) and inserted into 
pAdBglll which had been digested with Bglll and Afllll. This resulted in the insertion of 



- 60 - 



W ° 98/17783 PCT/US97/,9541 

LoxP sequences ,mo .he shuttle vector Hanking the packaging signals that are located between 
0.5 and I mu of Ad5 (one mu equals 360 bp in the Ad5 genome: the sequence of the Ad5 
genome ,s Isi.ed in SEQ ID NO: I). The y loxP sequence was inserted into the BgM site 
within pAdBglll. The 5" loxP sequence was inserted into .he AflW site located at tese 143 
(-0.8 mu) (numbering relative to Ad5). 

DNA sequencing was used to verify the final structure of the modified shuttle plasmid 
between 0-1 mu. If Cre recombinase-mediated excision of the packaging signals is found to 
be too efficient (as judged by the production of too little helper virus), alternate sites for the 
insertion of the loxP sequences are used that would result in deletion of 1 3 or 4 packaging 

. sequences rather than all 5 [Grable and Hearing (1990) J. Virol. 64:2047]. The insertion of 
loxP sequences at sites along the Ad genome contained within the shuttle vector which would 
result in the deletion 2. 3 or 4 packaging sequences are easily made using the technique of 
recombinant PCR [Higuchi (1990) In: PCR Protocols: A Guide to Methods and 
Applications. Innis et al. (eds.) Academic Press. San Diego. CA. pp. 177-183]. The optimal 
amount of Cre recombinase-mediated excision of the packaging signals is that amount which 
permits the production of enough packaged helper virus to permit the slow spread of virus on 
the first plate of cells co-.ransfected with helper virus DNA (containing the loxP sequences) 
and the EAM vector. This permits serial passage of the EAM preparation onto a subsequent 
lawn of cells to increase the titer of the EAM preparation. Alternatively, if Cre recombinase- 
mediated excision of the packaging signals is essentially 100% efficient in all Cre-expressing 
cells lines (i.e.. regardless of the level of ^-expression, that is even a cell line expressing a 
low level of Cre as judged by the GFP/p-gal assay described above), the packaged EAMs 
may be used along with helper virus (used at a MOI of -1.0) to infect the second or 
subsequent lawn of cells to permit serial passaging to increase the titer of the EAM 

. preparation. 

Following introduction of the loxP sequences into the shuttle vector, the human 
placental alkaline phosphatase (HpAp) cDNA under control of the RSV promoter was inserted 
into the polylinker to provide a reporter gene for the helper virus. This is the same reporter 
used previously during EAM generation (Ex. 6). The HpAp sequences were inserted as 
follows. The /arP-containing shuttle vector was linearized with XJw\ and the hpAp cassette 
was ligated into the Xho\ site (a Xho\ site was inserted into pAdBglll during the insertion of 
the loxP sequences as a Xlw\ site was located on the 3" end of the loxP sequences inserted 
into the %/fl site of pAdBglll). The HpAp cassette was constructed as follows. 
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pRSVhAPT40 (obtained from Gary Nabel. Univ. of Michigan. Ann Arbor. Ml) was digested 
with EcoRl to generate an EcoRl fragment containing the HpAP cDNA and the SV40 intron 
and polyadenylation sequences. pRc/RSV (Invitrogen) was digested with Hindlll. then 
partially digested with EcoRl. A 5.208 bp fragment was then size selected on an agarose gel. 

5 treated with calf alkaline phosphatase, and ligated to the EcoRl fragment derived from 

pRSVhAPT40 to generate pRc/RSVAP. pRc/RSVAP was then digested with Sail and Xhol to 
liberate the RSV promoter linked to the HpAP cDNA cassette (including the SV40 intron and 
polyadenylation sequences). This Sal\-Xho\ fragment was inserted into the /av/'-containing 
shuttle vector which had been digested with Sail and Xhol to generate pADLoxP-RSVAP. 

10 Helper virus containing loxP sites flanking the Ad packaging signals is generated by 

co-transfection of the LoxP shuttle plasmid (pADLoxP-RSVAP) and C/al-digested 
Ad5t//7001 DNA into 293 cells [Graham and Prevec (1991) Manipulation of Adenovirus 
Vectors. In Methods in Molecular Biology, Vol. ". Gene Transfer and Expression Protocols. 
Murray (ed.). Humana Press Inc.. Clifton. NJ. pp. 109-128]. Figure 19 provides a schematic 

15 showing the recombination event between the loxP shuttle vector and the Ad5t//70OI genome. 
Co-transfection is carried out as described in Example 3c. 

Alternatively, the reporter gene may be inserted into the E3 region of pBHGlO or 
pBHGI 1 (using the unique Pad site) rather than into the polyl inker located in the El region 
of the shuttle vector. The reporter gene-containing pBHGIO or 1 1 is then used in place of 

20 Ad5t//7001 for cotransfection of 293 cells along with the loxP-containing pAdBglll. 

Following cotransfection, recombinant plaques are picked, plaque purified, and tested 
for incorporation of both hpAp and loxP sequences by PCR and Southern analysis (Ex. 3). 
Viruses which contain loxP sites flanking the packaging signals and the marker gene (hpAp) 
are retained, propagated and purified. 

25 The isolated helper virus containing loxP sites flanking the packaging signals and the 

marker gene is then used to infect both 293 cells and the 293 cell lines that express Cre 
(section a. above). Cre recombinase-expressing cell lines that produce optimal levels of Cre 
recombinase when infected with the /atP-containing helper virus are then used for the 
generation of EAMs as described below. 

30 
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c) Generation Of Encapsidated Adenovirus Minichromosomes 
That Express Dystrophin 

The growth of EAMs is optimized using the following methods. In the first method, 
plasmid DNA from the EAM vector (Ex. 6) are co-transtected into 293 cells with purified 
viral DNA from the helper virus, and viruses are harvested 10-14 days later after appearance 
of a viral cytopathic effect (CPE) as described in Example 6. This approach is simple, vet 
potentially increases helper virus levels by allowing viral spread throughout the culture dishes. 

In the second method, the co-transfected 293 cells are overlaid with agar and single 
plaques are picked and tested for EAM activity by the ability to express P-galactosidase 
activity following infection of Hela cells (Ex. 6). This second approach is more time 
consuming, but will result in less contamination by helper. 

The ability of these two methods to generate EAMs are directly compared. The 
preferred method is that which produces the highest titer of EAM with the lowest 
contamination of helper virus. The efficiency of EAM generation may also be increased by 
using helper virus DNA that retains the terminal protein (TP) (i.e.. the helper virus is used as 
a viral DNA-TPC as described in Ex. 3) The use of viral DNA-TPC has been shown to 
increase the efficiency of viral production following transfection of 293 cells by an order of 
magnitude [Miyake at al. (1996). .supra]. 

The initial transfection will utilize 293 cells that do not express Cre recombinase. so 
that efficient spread of the helper can lead to large scale production of EAM. The method 
producing the highest ratio of EAM to helper virus is then employed to optimize conditions 
for the serial propagation of the EAM on 293 cells expressing Cre recombinase. This 
optimization is conducted using cell lines expressing different levels of Cre recombinase. The 
following variables are tested: I) the ratio of input viral titer to cell number. 2) the number 
of serial passages to use for EAM generation, 3) the use of cell lines producing different 
levels of Cre to achieve the optimal ratio between high EAM titer and low helper titer. 4) 
continuous growth on Ox-producing 293 cells versus alternating between Cn?-producing cells 
and the parental 293 cells, or to alternate between high and low Ge-producing cells and 5) 
CsCl purification of EAMs prior to re-infection of 293 cells increases the ratio of the final 
EAM/helper titers. The protocols that result in the highest yield of EAM with minimal helper 
virus are used to generate large volumes of crude viral lysates for purification by density 
gradient centrifugation (Ex. 6). 
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EAMs were purified from helper virus using standard CsCi density gradient * 
centrifugation protocols in Example 6 and resulted in preparations containing -4% helper virus 
In order to improve the physical separation of EAMs and helper virus, a varietv of different 
centrifugation conditions are possible, including changing the gradient shape, type of rotor and 
tube used, combinations of step and continuous gradients, and the number of gradients used. 
Materials with better resolving powers than CsCl can also be employed. These include 
rubidium chloride and potassium bromide [Reich and Zarybnicky (1979) Annal. Biochem. 
94:1931. 



d) Use Of EAMs Encoding Dystrophin For Long Term 
Expression Of Dystrophin in Muscle 

To demonstrate the ability of the purified dystrophin expressing EAMs (prepared as 
described above) to deliver and express dystrophin in the muscle of an animal, the following 
experiment is conducted. First, purified dystrophin-EAMs are delivered to the muscle by- 
direct intramuscular injection into newborn. 1 month, and adult (3 month) mouse quadriceps 
muscle. The dystrophin expressing EAM AdSpDys are injected into mdx mice and into 
transgenic mdx mice that express P-galactosidase in the pituitary gland [Tripathy et ai f 1996) 
Nature Med. 2:545]. These latter mice are used to avoid potential immune-rejection of cells 
expressing p-galactosidase from the Ad5pDys vector. An alternate EAM lacking the P- 
galactosidase reporter gene may also be employed; however, the presence of the 0-Gal 
reporter simplifies EAM growth and purification (vectors lacking p-Gal have their purity 
estimated by PCR assays rather than by p-Gal assays). 

Following intramuscular injection of EAM. animals are sacrificed at intervals between 
1 week and 6 months to measure dystrophin expression [by western blot analysis and by 
immunofluorescence (Phelps et ai (1995) Hum. Mol. Genet. 4:1251 and Rafael et ai (1996) 
J. Cell Biol. 134:93] and muscle extracts will also be assayed for p-Gal activity [MacGregor 
et ai (1991). supra]. These results are compared with previous results obtained using current 
generation viral vectors (i.e.. containing deletions in El and E3 only) to demonstrate that 
EAMs improve the prospects for long term gene expression in muscle. 
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mdx mice display a 40% reduction in force and power generation compared with wild-type 
mice, while 6 month animals display a greater than 50% reduction (Cox e/ id. (1993) Nature 
364:725; Rafael a( id. (1994). supra: Phelps et al. (1995). supra: and Corrado et al. (1996) J. 
Cell. Biol. 134:873]. 



EXAMPLE 8 

Improved Shuttle Vectors For The 
Production Of Helper Virus Containing LoxP Sites 



In the previous example, a shuttle vector containing adenoviral sequences extendine 
from 9 mu to 16 mu of the Ad5 genome was modified to contain loxP sequences surroundine 
the packaging signals (the LoxP shuttle vector). This modified shuttle vector was then 
recombined with an Ad virus to produce a helper virus containing the loxP sequences. This 
helper virus was then used to infect cells expressing Cre recombinase along with DNA 
comprising a minichromosome containing the dystrophin gene, a reporter gene and the 
packaging signals and ITRs of Ad in order to preferentially package the minichromosomes. 
Using this approach the helper virus, which has had the packaging signals removed by Cre- 
loxP recombination, contains the majority of the Ad genome (only a portion of the E3 region 
is deleted). Thus, if low levels of helper virus are packaged and appear in the EAM 
preparation, the EAM preparation has the potential of passing on helper virus capable of 
directing the expression of Ad proteins in cells which are exposed to the EAM preparation. 
The expression of Ad proteins may lead to an immune response directed against the infected 
cells. 

Another approach to reducing the possibility that the EAM preparation contains helper 
virus capable of provoking an immune response is to use helper viruses containing deletions 
and/or mutations within the pol and pTP genes. Helper virus containing a deletion in the pol 
and/or pTP genes is cotransfected with the EAM construct into 293-derived cell lines 
expressing pol or pol and pTP to produce EAMs. Any helper virus present in the purified 
EAM preparation will be replication defective due to the deletion in the pol and/or pTP genes. 
As shown in Example 3, viruses containing a deletion in the pol gene are incapable of 
directing the expression of viral late genes; therefore, helper viruses containing a deletion in 
the pol gene or the pol and pTP genes should not be capable of provoking an immune 
response a CTL response) against late viral proteins synthesized de novo. Shuttle 
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signals as outlined in Example 7. The /av/^containing pol" or poi". pTP" shuttle vectors 
(pTP- shuttle vectors may also be employed) are cotransfected into 293 cells expressine Cre 
recombinase and pol or Cre recombinase. pol and pTP. respectively along with an appropriate 
El" viral DNA-TPC (the El- viral DNA may also contain deletions elsewhere in the genome 
5 such as in the E4 genes or in the E2a gene as packaging cell lines expressing the E4 ORf 6 
or 6/7 and lines expressing E2a genes are avaialble) to generate helper virus containing loxP 
sites flanking the packaging signals as well as a deletion in the pol gene or the pol and pTP 
genes. The resulting helper virus(es) is used to cotransfect 293 cells expressing pol or pol 
and pTP along with the desired EAM construct. The resulting EAM preparation should 

10 contain little if any helper virus and any contaminating helper virus present would be 

replication defective and incapable of expressing viral late gene products. Helper viruses 
containing loxP sequences and deletions in all essential early genes may be employed in 
conjunction with Cre recombinanse-expressing cell lines expressing in irons the EL E4 ORF 
6. E2a. and E2b (e.g.. Ad polymerase and pTP) proteins (the E3 proteins are dispensible for 

15 growth in culture). Cell lines coexpressing El. polymerase and pTP are provided herein. 

Cell lines expressing El and E4 proteins have been recently described [Krougliak and Graham 
(1995) Hum. Gene Ther. 6:1575 and Wang et ai (1995) Gene Ther. 2:775] and cell lines 
expressing El and E2a proteins have been recently described [Zhou et ai (1996) J. Virol. 
70:7030). Therefore, a cell line co-expressing EI, E2a. E2b. and E4 is consructed by 

20 introduction of expression plasmids containing the E2a and E4 coding regions into the E1-. 

Ad polymerase- and pTP-expressing cell lines of the present invention. These packaging cell 
lines are used in conjunction with helper viruses containing deletions in the EI. E2a. E2b and 
E4 regions. 

All publications and patents mentioned in the above specification are herein 
25 incorporated by reference. Various modifications and variations of the described method and 
system of the invention will be apparent to those skilled in the art without departing from the 
scope and spirit of the invention. Although the invention has been described in connection 
with specific preferred embodiments, it should be understood that the invention as claimed 
should not be unduly limited to such specific embodiments. Indeed, various modifications of 
30 the described modes for carrying out the invention which are obvious to those skilled in 
molecular biology or related fields are intended to be within the scope of the following 
claims. 
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SEQUENCE LISTING 

(I) GENERAL INFORMATION: 

(i) APPLICANT: Chamberlain. Jeffrey S. 

Amalfitano, Andrea 
Hauser, Michael A . 
Kumar- Singh. Ra jendra 
Hartigan-O' Connor , Dennis J. 

(ii) TITLE OF INVENTION: IMPROVED ADENOVIRUS VECTORS 
(iii) NUMBER OF SEQUENCES: 15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Medlen & Carroll, LLP 

(B) STREET:. 220 Montgomery Street, Suite 2200 

(C) CITY: San Francisco 

(D) STATE: California 

(E) COUNTRY : United States Of America 

(F) ZIP: 94104 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE : Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS/MS - DOS 

(D) SOFTWARE: Patentln Release #1.0, Version Si. 30 

(vi } CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 
(A) NAME: Ingolia, Diane E. 
<B) REGISTRATION NUMBER: 40,027 
(C) REFERENCE /DOCKET NUMBER • UM-02484 

(ix) TELECOMMUNICATION INFORMATION • 

(A) TELEPHONE: (415) 705-8410 

(B) TELEFAX: (415) 397-8338 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 35935 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = " DNA " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CATCATCAAT AATATACCTT ATTTTGGATT GAAGCCAATA TGATAATGAG GGGGTGGAGT 60 

TTGTGACGTG GCGCGGGGCG TGGGAACGGG GCGGGTGACG TAGTAGTGTG GCGGAAGTGT 120 

GATGTTGCAA GTGTGGCGGA ACACATGTAA GCGACGGATG TGGCAAAAGT GACGTTTTTG 180 

GTGTGCGCCG GTGTACACAG GAAGTGACAA TTTTCGCGCG GTTTTAGGCG GATGTTGTAG 2 40 

TAAATTTGGG CGTAACCGAG TAAGATTTGG CCATTTTCGC GGGAAAACTG AATAAGAGGA 3 00 

AGTGAAATCT GAATAATTTT GTGTTACTCA TAGCGCGTAA TATTTGTCTA GGGCCGCGGG 3 60 

GACTTTGACC GTTTACGTGG AGACTCGCCC AGGTGTTTTT CTCAGGTGTT TTCCGCGTTC 4 20 

CGGGTCAAAG TTGGCGTTTT ATTATTATAG TCAGCTGACG TGTAGTGTAT TTATACCCGG 4 80 
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TGAGT *. GCTC 


AAG AGG CC AC 


i. CTTGAGTGC 


CAGCGAGTAG 


AGTTTTCTCC 


TCCGAGCCGC 


540 




C" ft f~"TT* ft n TV ft 
GGAL TGAAAA 


ft ft ft *T* ft 

i G AGACATAT 


TATCTGCCAC 


GGAGGTGTTA 


TTACCGAAGA 


600 


A a t ^ ^ C r~ ^ r^" 
AA 1 Gv_jv_CGv_L 


AG rCTTTTGG 


ACCAGCTGAT 


CGAAGAGGTA 


CTGGCTGATA 


ATCTTCCACC 


660 


I \_ w I r.'ov. Lrtl 




G i ALGG i rC_ A 


CGmACTGTAT 


GATTTAGACG 


TGACGGCCCC 


720 


pp 2i a.p. itpt 




GGG III GGGA 


Z - * TV *f **T"T*' 1*' pi 

GATTTTTCCC 


GACTCTGTAA 


TGTTGGCGGT 


780 


p, r anr; a a r.nn 


Zi T TP 21 PTT a P 


TP a PTTTTrr 

i LAL 111 IGG 


GCGGGLGCCC 


GGTTCTCCGG 


AGCCGCCTCA 


840 


III ^.uLuy 


V- AVj (.L L. o Avj V_ 


AGGLGGAGCA 


GAGAGCCTTG 


GGTCCGGTTT 


CTATGCCAAA 


900 




P S.P.P.TP. A.TPP 


HILl l ALL 1 G 


<^^ft z™ 1 /^ RprrT 
GGAGGAGGCT 


GGCTTTCCAC 


CCAGTGACGA 


960 


pp a p p. l tp a a 




AG 1 1 iulbl I 


ft ^ ft TT ft TP T'/"* 

AGA TTA TGTG 


GAGCACCCCG 


GGCACGGTTG 


1020 




p A ttr t*p nrr 
CA i i Ai CALL 


GGAGGAATAC 


GGGGGACCCA 


GATATTATGT 


GTTCGCTTTG 


X0B0 


v_ 1 A 1 A i Li AGLr 


ACLTG LGGC A 


TGTTTGTCTA 


y-^ TV T\ TA O ^^S*^ TV 

CAGTAAGTGA 


AAATTATGGG 


CAGTGGGTGA 


1140 


1 AG AG Ibb iG 


GG I i 1GG 1 G I 


/-^ O r T 1 ft ft IT* T 1 'foil 

GGTAATTTTT 


TTTTTAATTT 


TTACAGTTTT 


GTGGTTTAAA 


1200 


/"* ft ft TT" ' " T '/"""P ft 

GAATT . TGTA 


TTGTGATTTT 


' MM "¥*1V TV TV 7\ y-^ ^T^ 

TTTAAAAGGT 


CCTGTGTCTG 


AACCTGAGCC 


TGAGCCCGAG 


1260 


C C AOmj-iv. 0 


AGGv_ iGCAAG 


ACCTACCCGC 


CGTCCTAAAA 


TGGCGCCTGC 


TATCCTGAGA 


1320 


CGCCv- G ACAT 


CACCTGTGTC 


^T^ 7A TV TV T\ rf^y^ 

TAGAGAATGC 


AATAGTAGTA 


CGGATAGCTG 


TGACTCCGGT 


1380 


CCTTv_ T /-AC A 


CACCTCCTGA 


GATACACCCG 


GTGGTCCCGC 


TGTGCCCCAT 


TAAACCAGTT 


1440 


GCCGTGAGAG 


TTGGTGGGCG 


TCGCCAGGCT 


GTGGAATGTA 


TCGAGGACTT 


GCTTAACGAG 


1500 


CCTGGuCAAC 


CTTTGGACTT 


GAGCTGTAAA 


CGCCCCAGGC 


CATAAGGTGT 


AAACCTGTGA 


1560 


TTGCGTGTGT 


GGTTAACGCC 


TTTGTTTGCT 


GAATGAGTTG 


ATGTAAGTTT 


AATAAAGGGT 


1620 


GAGATAATGT 


TTAACTTGCA 


TGGCGTGTTA 


AATGGGGCGG 


GGCTTAAAGG 


GTATATAATG 


1680 


CGCCGTGGG\_ 


TAATCTTGGT 


TACATCTGAC 


CTCATGGAGG 


CTTGGGAGTG 


TTTGGAAGAT 


1740 


TTTTCTGCTG 


I GCGTAACTT 


GCTGGAACAG 


AGCTCTAACA 


GTACCTCTTG 


GTTTTGGAGG 


1800 


TTTC * GTGoG 


GCTCATCCCA 


TV TV TV ^^^^ TA 

GGCAAAGTTA 


GTCTGCAGAA 


TTAAGGAGGA 


TTACAAGTGG 


1860 


vjAATT t G AAG 


AGCTTTTGAA 


ATCCTGTGGT 


GAGCTGTTTG 


ATTCTTTGAA 


TCTGGGTCAC 


1920 


CAGGCGCT iT 


TCCAAGAGAA 


GGTCATCAAG 


ACTTTGGATT 


TTTCCACACC 


GGGGCGCGCT 


1980 


GCGGCTGCTG 


TTGCTTTTTT 


GAGTTTTATA 


AAGGATAAAT 


GGAGCGAAGA 


AACCCATCTG 


2040 


AGCGGGGGGT 


ACCTGCTGGA 


TTTTCTGGCC 


ATGCATCTGT 


GGAGAGCGGT 


TGTGAGACAC 


2100 


AAG AATCG C C 


TG CT ACTG TT 


GTCTTCCGTC 


CGCCCGGCGA 


TAATACCGAC 


GGAGGAGCAG 


2160 


CAGCAGCAGC 


AGGAGGAAGC 


CAGGCGGCGG 


CGGCAGGAGC 


AGAGCCCATG 


GAACCCGAGA 


2220 


GCCGGCC J.GG 


ACCCTCGGGA 


TV T*/"* TV TV rv^^^r^f^^V 

ATGAATGTTG 


TACAGGTGGC 


TGAACTGTAT 


CCAGAACTGA 


2280 


GACGCATTTT 


TV IV IV ^^^^ fv 

GACAATTACA 


GAGGATGGGC 


AGGGGCTAAA 


GGGGGTAAAG 


AGGGAGCGGG 


2340 


GGGCTTGTGA 


GGCTACAGAG 


GAGGCTAGGA 


ATCTAGCTTT 


TAGCTTAATG 


ACCAGACACC 


2400 


GTCCTGAGTG 


TATTACTTTT 


CAACAGATCA 


AGGATAATTG 


CGCTAATGAG 


CTTGATCTGC 


2460 


TGGCGCAGAA 


GTATTCCATA 


GAGCAGCTGA 


CCACTTACTG 


GCTGCAGCCA 


GGGGATGATT 


2520 


TTGAGGAGGC 


TATTAGGGTA 


TATGCAAAGG 


TGGCACTTAG 


GCCAGATTGC 


AAGTACAAGA 


2580 
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2640 
2700 
2760 
2920 
2880 



TCAGCAAACT TGTAAATATC AGGAATTGTT GCTACATTTC TGGGAACGGG GCCGAGGTGG 
AGATAGATAC GGAGGATAGG GTGGCCTTTA GATGTAGCAT GATAAATATG TGGCCGGGGG 
TGCTTGGCAT GGACGGGGTG GTTATTATGA ATGTAAGGTT TACTGGCCCC AATTTTAGCG 
GTACGGTT7T CCTGGCCAAT ACCAACCTTA TCCTACACGG TGTAAGCTTC TATGGGTTTA 
ACAATACCTG TGTGGAAGCC TGGACCGATG TAAGGGTTCG GGGCTGTGCC TTTTACTGCT 

GCTGGAAGGG GGTGGTGTGT CGCCCCAAAA GCAGGGCTTC AATTAAGAAA TGCCTCTTTG 2940 

AAAGGTGTAC CTTGGGTATC CTGTCTGAGG GTAACTCCAG GGTGCGCCAC AATG TGGCCT 3000 

CCGACTGTGG TTGCTTCATG CTAGTGAAAA GCGTGGCTGT GATTAAGCAT AACATGGTAT 3060 

GTGGCAACTG CGAGGACAGG GCCTCTCAGA TGCTGACCTG CTCGGACGGC AACTGTCACC 3120 

TGCTGAAGAC CATTCACGTA GCCAGCCACT CTCGCAAGGC CTGGCCAGTG TTTGAGCATA 3i 80 

ACATACTGAC CCGCTGTTCC TTGCATTTGG GTAACAGGAG GGGGGTGTTC CTACCTTACC 3240 

AATGCAATTT GAGTCACACT AAGATATTGC TTGAGCCCGA GAGCATGTCC AAGGTGAACC 3300 

TGAACGGGGT GTTTGACATG ACCATGAAGA TCTGGAAGGT GCTGAGGTAC GATGAGACCC 3 360 

GCACCAGGTG CAGACCCTGC GAGTGTGGCG GTAAACATAT TAGGAACCAG CCTGTGATGC 3420 

TGGATGTGAC CGAGGAGCTG AGGCCCGATC ACTTGGTGCT GGCCTGCACC CGCGCTGAGT 34 80 

TTGGCTCTAG CGATGAAGAT ACAGATTGAG GTACTGAAAT GTGTGGGCGT GGCTTAAGGG 3540 

TGGGAAAGAA TATATAAGGT GGGGGTCTTA TGTAGTTTTG TATCTGTTTT GCAGCAGCCG 3600 

CCGCCGCCAT GAGCACCAAC TCGTTTGATG GAAGCATTGT GAGCTCATAT TTGACAACGC 3660 

GCATGCCCCC ATGGGCCGGG GTGCGTCAGA ATGTGATGGG CTCCAGCATT GATGGTCGCC 3 720 

CCGTCCTGCC CGCAAACTCT ACTACCTTGA CCTACGAGAC CGTGTCTGGA ACGCCGTTGG 3780 

AGACTGCAGC CTCCGCCGCC GCTTCAGCCG CTGCAGCCAC CGCCCGCGGG ATTGTGACTG 3840 

ACTTTGCTTT CCTGAGCCCG CTTGCAAGCA GTGCAGCTTC CCGTTCATCC GCCCGCGATG 3 900 

ACAAGTTGAC GGCTCTTTTG GCACAATTGG ATTCTTTGAC CCGGGAACTT AATGTCGTTT 3 96 0 

CTCAGCAGCT GTTGGATCTG CGCCAGCAGG TTTCTGCCCT GAAGGCTTCC TCCCCTCCCA 4 020 

ATGCGGTTTA AAACATAAAT AAAAAACCAG ACTCTGTTTG GATTTGGATC AAGCAAGTGT 4 080 

CTTGCTGTCT TTATTTAGGG GTTTTGCGCG CGCGGTAGGC CCGGGACCAG CGGTCTCGGT 4140 

CGTTGAGGGT CCTGTGTATT TTTTCCAGGA CGTGGTAAAG GTGACTCTGG ATGTTCAGAT 4200 

ACATGGGCAT AAGCCCGTCT CTGGGGTGGA GGTAGCACCA CTGCAGAGCT TCATGCTGCG 4260 

GGGTGGTGTT GTAGATGATC CAGTCGTAGC AGGAGCGCTG GGCGTGGTGC CTAAAAATGT 4320 

CTTTCAGTAG CAAGCTGATT GCCAGGGGCA GGCCCTTGGT GTAAGTGTTT ACAAAGCGGT 4 380 

TAAGCTGGGA TGGGTGCATA CGTGGGGATA TGAGATGCAT CTTGGACTGT ATTTTTAGGT 4440 

TGGCTATGTT CCCAGCCATA TCCCTCCGGG GATTCATGTT GTGCAGAACC ACCAGCACAG 4500 

TGTATCCGGT GCACTTGGGA AATTTGTCAT GTAGCTTAGA AGGAAATGCG TGGAAGAACT 4 560 

TGGAGACGCC CTTGTGACCT CCAAGATTTT CCATGCATTC GTCCATAATG ATGGCAATGG 4620 

GCCCACGGGC GGCGGCCTGG GCGAAGATAT TTCTGGGATC ACTAACGTCA TAGTTGTGTT 4680 
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CCAGGATGAG 

GTATAATGGT 
CTTTGAGTTC 
GGGTAGGGGA 
CGGTGGGCCC 
TGCCGTCATC 
CCCTGACCAA 
CAAAGTTTTT 
GCAGTTCCAG 
CTCCTCGTTT 
ACGGGCCAGG 
GGTGAAGGGG 
GGTGCTGAAG 
GTCATAGTCC 
GCCGCACGAG 
TTCCGGGGAG 
GGTGAGCTCT 
CTTACCTCTG 
CCCGTATACA 
AAACTCGGAC 
GGAGGGGTAG 
GTCGCCCTCT 
TGTTCCTGAA 
ATCGCTGTCT 
TTCTGCGCTA 
GGTGATGCCT 
AAGCTTGGTG 
GGTTTGGTTT 
GCGCGCAACG 
GCGCCAACCG 
GCGCTCGTTG 
TAGCTGCGTC 
GTCGAAGTAG 
AAGCGCGCGC 
GGCGTACATG 



ATCGTCATAG 
TCCATCCGGC 
AGATGGGGGG 
GATCAGCTGG 
GTAAATCACA 
CCTGAGCAGG 
ATCCGCCAGA 
CAACGGTTTG 
GCGGTCCCAC 
CGCGGGTTGG 
GTCATGTCTT 
TGCGCTCCGG 
CGCTGCCGGT 
AGCCCCTCCG 
GGGCAGTGCA 
TAGGCATCCG 
GGCCGTTCGG 
GTTTCCATGA 
GACTTGAGAG 
CACTCTGAGA 
CGGTCGTTGT 
TCGGCATCAA 
GGGGGGCTAT 
GCGAGGGCCA 
AGATTGTCAG 
TTGAGGGTGG 
GCAAACGACC 
TTGTCGCGAT 
CACCGCCATT 
CGGTTGTGCA 
GTCCAGCAGA 
TCGTCCGGGG 
TCTATCTTGC 
TCGTATGGGT 
CCGCAAATGT 



GCCATTTTTA 
CCAGGGGCGT 
ATCATGTCTA 
GAAGAAAGCA 
CCTATTACCG 
GGGGCCACTT 
AGGCGCTCGC 
AGACCGTCCG 
AGCTCGGTCA 
GGCGGCTTTC 
TCCACGGGCG 
GCTGCGCGCT 
CTTCGCCCTG 
CGGCGTGGCC 
GACTTTTGAG 
CGCCGCAGGC 
GGTCAAAAAC 
GCCGGTGTCC 
GCCTGTCCTC 
CAAAGGCTCG 
CCACTAGGGG 
GGAAGGTGAT 
AAAAGGGGGT 
GCTGTTGGGG 
TTTCCAAAAA 
CCGCATCCAT 
CGTAGAGGGC 
CGGCGCGCTC 
CGGGAAAGAC 
GGGTGACAAG 
GGCGGCCGCC 
GGTCTGCGTC 
ATCCTTGCAA 
TGAGTGGGGG 
CGTAAACGTA 



CAAAGCGCGG 
AGTTACCCTC 
CC7GCGGGGC 
GGTTCCTGAG 
GGTGCAACTG 
CGTTAAGCAT 
CGCCCAGCGA 
CCGTAGGCAT 
CCTGCTCTAC 
GCTGTACGGC 
CAGGGTCCTC 
GGCCAGGGTG 
CGCGTCGGCC 
CTTGGCGCGC 
GGCGTAGAGC 
CCCGCAGACG 
CAGGTTTCCC 
ACGCTCGGTG 
GAGCGGTGTT 
CGTCCAGGCC 
GTCCACTCGC 
TGGTTTGTAG 
GGGGGCGCGT 
TGAGTACTCC 
CGAGGAGGAT 
CTGGTCAGAA 
GTTGGACAGC 
CTTGGCCGCG 
GGTGGTGCGC 
GTCAACGCTG 
CTTGCGCGAG 
CACGGTAAAG 
GTCTAGCGCC 
ACCCCATGGC 
GAGGGGCTCT 



GCGGAGGGTG 
ACAGATTTGC 
G ATG AAG AAA 
CAGCTGCGAC 
GTAGTTAAGA 
GTCCCTGACT 
TAGCAGTTCT 
GCTTTTGAGC 
GGCATCTCGA 
AGTAGTCGGT 
GTCAGCGTAG 
CGCTTGAGGC 
AGGTAGCATT 
AGCTTGCCGT 
TTGGGCGCGA 
GTCTCGCATT 
CCATGCTTTT 
ACGAAAAGGC 
CCGCGGTCCT 
AGCACGAAGG 
TCCAGGGTGT 
GTGTAGGCCA 
TCGTCCTCAC 
CTCTGAAAAG 
TTGATATTCA 
AAGACAATGT 
AACTTGGCGA 
ATGTTTAGCT 
TCGTCGGGCA 
GTGGCTACCT 
CAGAATGGCG 
ACCGCGGGCA 
TGCTGCCATG 
ATGGGGTGGG 
CTGAGTATTC 
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CGCGGGCGGC 


6660 


TGAGCGCGGA 


6720 


CAAGATATGT 


6780 
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* CGTGCGAGGG 


6 34 0 


i AGACTATCTG 


6 900 


' TGAAGCTGGC 


6 960 


* GCTTGTTGAC 


702 0 


1 TGATGATGTC 


70ft o 


CTTCGCGGTC 




CTAGCATGTA 


7200 


CGTATGCCTG 


7260 


TGACTTTGAG 


7320 


AAAAGTCCGT 


7380 


GTATCTTTCC 


7440 


AACGGTTGTT 


7500 


CCACAATGTA 


7560 


CCTCGTAGGT 


7620 


GATGAGGGTT 


768 0 


GGTCGCGAAA 


7 74 0 


AGGTAAGCGG 


7 8 00 


CAGTCACTAG 




TCCCAAAGGC 


7 q t n 


TGCGAGGATG 


798 0 


TATTGATGTG 


8 04 0 


AAAAACGTGC 


8100 


GACGACCGCG 


8160 


GGTGGTCTTC 


8220 


TGGATCGGAC 


8280 


GCTTGATGAC 


8340 


GGTCAGGCGG 


8400 


CCAGGTGATA 


8460 


CGCATCCCCG 


8520 


TGGATGATGC 


8580 


ACCCGCCGGG 


8640 


CGCGCGTAGG 


8 700 


CTGCGTGAAG 


8760 


TTCGGTGTCG 


8820 


ATAGGCGATC 


8880 
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TCGGCCATGA 


ACTGCTCGAT 


CTCTTCCTCC 


TGGAGATCTC 


CGCGTCCGGC 


TrnrT^rarr 


3*94 0 


GTGGCGGCGA 


GGTCGTTGGA 


AATGCGGGCC 


ATGAGCTGCG 


AGAAGGCGTT 


VJrtVJO^V- 1 \_ v_ ^ 


9000 


TCGTTCCAGA 


CGCGGCTGTA 


GACCACGCCC 


CCTTCGGCAT 


CGCGGGCGCG 


CATG A CC* A (T 


yobu 


TGCGCGAGAT 


TGAGCTCCAC 


GTGCCGGGCG 


AAGACGGCGT 


AGTTTCGCAG 


GCGCTG A A ar; 




AGGTAGTTGA 


GGGTGGTGGC 


GGTGTGTTCT 


GCCACGAAGA 


AGTACATAAC 


CCAGCGTPGP 


Q i Q n 


AACGTGGATT 


CGTTGATATC 


CCCCAAGGCC 


TCAAGGCGCT 


CCATGGCCTC 


G TAG AAG TCC 




ACGGCGAAGT 


TGAAAAACTG 


GGAGTTGCGC 


GCCGAGACGG 


TTAACTCCTC 


CTCCAGAAGA 


q i nn 


CGGATGAGCT 


CGGCGACAGT 


GTCGCGCACC 


TCGCGGTCAA 


AGGCTACAGG 


GGCCTCTTPT 


? JbU 


TCTTCTTCAA 


TCTCCTCTTG 


CATAAGGGCC 


TCCCCTTCTT 


CTTCTTCTGG 


CGGCGGTGGG 




GGAGGGGGGA 


CACGGCGGCG 


ACGACGGCGC 


ACCGGGAGGC 


GGTCGACAAA 


GGGCTCGATP 


7 *t O u 


ATCTCCCCGC 


GGCGACGGCG 


CATGGTCTCG 


GTGACGGCGC 


GGCCGTTCTC 


GCGGGGGPGP 




AGTTGGAAGA 


CGCCGCCCGT 


CATGTCCCGG 


TTATGGGTTG 


GCGGGGGGCT 


GCCATGCGGP 


7bUU 


AGGGATACGG 


CGCTAACGAT 


GCATCTCAAC 


AATTGTTGTG 


TAGGTACTCC 


GCCGCCGAGG 


a c cl n 


GACCTGAGCG 


AGTCCGCATC 


GACCGGATCG 


GAAAACCTCT 


CG AG AAAGGC 


GTCTAACCAG 


Q 7*5 n 


TCACAGTCGC 


AAGGTAGGCT 


GAGCACCGTG 


GCGGGCGGCA 


GCGGGCGGCG 


GTCGGGGTTG 


Q 7 O ft 


TTTCTGGCGG 


AGGTGCTGCT 


GATGATGTAA 


TTAAAGTAGG 


CGGTCTTGAG 


ACGGCGGATG 




GTCGACAGAA 


GCACCATGTC 


CTTGGGTCCG 


GCCTGCTGAA 


TGGGCAGGCG 


GTCGGCPATP, 


QQArt 
77UU 


CCCCAGGCTT 


CGTTTTGACA 


TCGGCGCAGG 


TCTTTGTAGT 


AGTCTTGCAT 


GAGCCTTTPT 


730U 


ACCGGCACTT 


CTTCTTCTCC 


TTCCTCTTGT 


CCTGCATCTC 


TTGCATCTAT 


CGCTGCGGPP, 


lvUt u 


GCGGCGGAGT 


TTGGCCGTAG 


GTGGCGCCCT 


CTTCCTCCCA 


TGCGTGTGAC 


CCCGAAGPPC 


10080 


CTCATCGGCT 


GAAGCAGGGC 


TAGGTCGGCG 


ACAACGCGCT 


CGGCTAATAT 


GGCCTGPTGP 




ACCTGCGTGA 


GGGTAGACTG 


GAAGTCATCC 


ATGTCCACAA 


AGCGGTGGTA 




-L y £ U U 


TTGATGGTGT 


AAGTGCAGTT 


GGCCATAACG 


GACCAGTTAA 


CGGTCTGGTG 


ACCCGGCTGP 


102 6 0 


GAGAGCTCGG 


TGTACCTGAG 


ACGCGAGTAA 


GCCCTCGAGT 


CAAATACGTA 


GTCGTTGPAA 


103 20 


GTCCGCACCA 


GGTACTGGTA 


TCCCACCAAA 


AAGTGCGGCG 


GCGGCTGGCG 


GTAGAGGGGP 


IvJOU 


CAGCGTAGGG 


TGGCCGGGGC 


TCCGGGGGCG 


AGATCTTCCA 


ACATAAGGCG 


ATGATATPPG 


1044 0 


TAGATGTACC 


TGGACATCCA 


GGTGATGCCG 


GCGGCGGTGG 


TGGAGGCGCG 


CGGAAAGTCG 


10500 


CGGACGCGGT 


TCCAGATGTT 


GCGCAGCGGC 


AAAAAGTGCT 


CCATGGTCGG 


GACGCTCTGG 


10560 


CCGGTCAGGC 


GCGCGCAATC 


GTTGACGGTC 


TAGACCGTGC 


AAAAGGAGAG 


CCTGTAAGCG 


10620 


GGCACTCTTC 


CGTGGTCTGG 


TGGATAAATT 


CGCAAGGGTA 


TCATGGCGGA 


CGACCGGGGT 


106 80 


TCGAGCCCCG 


TATCCGGCCG 


TCCGCCGTGA 


TCCATGCGGT 


TACCGCCGGG 


GTGTCGAACC 


10740 


CAGGTGTGCG 


ACGTCAGACA 


ACGGGGGAGT 


GCTCCTTTTG 


GCTTCCTTCC 


AGGCGCGGCG 


10800 


GCTGCTGCGC 




Li<jC\_AL iGGC 


CGCGCGCAGC 


GTAAGCGGTT 


AGGCTGGAAA 


10860 


GCGAAAGCAT 


TAAGTGGCTC 


GCTCCCTGTA 


GCCGGAGGGT 


TATTTTCCAA 


GGGTTGAGTC 


10920 


GCGGGACCCC 


CGGTTCGAGT 


CTCGGACCGG 


CCGGACTGCG 


GCGAACGGGG 


GTTTGCCTCC 


10980 
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CCCTCATGCA AGACCCCGCT TGCAAATTCC TCCGGAAACA GGGACGAGCC CCTTTTTTGC 11040 

TTTTCCCAGA TGCATCCGGT GCTGCGGCAG ATGCGCCCCC CTCCTCAGCA GCGGCAAGAG ll 100 

CAAGAGCAGC GGCAGACATG CAGGGCACCC TCCCCTCCTC CTACCGCGTC AGGAGGGGCG UISO 

ACATCCGCGG TTGACGCGGC AGCAGATGGT GATTACGAAC CCCCGCGGCG CCGGGCCCGG l 1220 

CACTACCTGG ACTTGGAGGA GGGCGAGGGC CTGGCGCGGC TAGGAGCGCC CTCTCCTGAG U280 
. CGGTACCCAA GGGTGCAGCT GAAGCGTGAT ACGCGTGAGG CGTACGTGCC GCGGCAGAAC 
CTGTTTCGCG ACCGCGAGGG AGAGGAGCCC GAGGAGATGC GGGATCGAAA GTTCCACGCA 
GGGCGCGAGC TGCGGCATGG CCTGAATCGC GAGCGGTTGC TGCGCGAGGA GGACTTTGAG 

CCCGACGCGC GAACCGGGAT TAGTCCCGCG CGCGCACACG TGGCGGCCGC CGACCTGGTA U520 

ACCGCATACG AGCAGACGGT GAACCAGGAG ATTAACTTTC AAAAAAGCTT TAACAACCAC U580 

. GTGCGTACGC TTGTGGCGCG CGAGGAGGTG GCTATAGGAC TGATGCATCT GTGGGACTTT u 640 
. GTAAGCGCGC TGGAGCAAAA CCCAAATAGC AAGCCGCTCA TGGCGCAGCT GTTCCTTATA 
GTGCAGCACA GCAGGGACAA CGAGGCATTC AGGGATGCGC TGCTAAACAT AGTAGAGCCC 
GAGGGCCGCT GGCTGCTCGA TTTGATAAAC ATCCTGCAGA GCATAGTGGT GCAGGAGCGC 
AGCTTGAGCC TGGCTGACAA GGTGGCCGCC ATCAACTATT CCATGCTTAG CCTGGGCAAG 

TTTTACGCCC GCAAGATATA CCATACCCCT TACGTTCCCA TAGACAAGGA GGTAAAGATC 11940 

GAGGGGTTCT ACATGCGCAT GGCGCTGAAG GTGCTTACCT TGAGCGACGA CCTGGGCGTT 12000 

TATCGCAACG AGCGCATCCA CAAGGCCGTG AGCGTGAGCC GGCGGCGCGA GCTCAGCGAC 12060 

CGCGAGCTGA TGCACAGCCT GCAAAGGGCC CTGGCTGGCA CGGGCAGCGG CGATAGAGAG 12120 

GCCGAGTCCT ACTTTGACGC GGGCGCTGAC CTGCGCTGGG CCCCAAGCCG ACGCGCCCTG 12180 

GAGGCAGCTG GGGCCGGACC TGGGCTGGCG GTGGCACCCG CGCGCGCTGG CAACGTCGGC 12240 

GGCGTGGAGG AATATGACGA GGACGATGAG TACGAGCCAG AGGACGGCGA GTACTAAGCG 12300 

GTGATGTTTC TGATCAGATG ATGCAAGACG CAACGGACCC GGCGGTGCGG GCGGCGCTGC . 12360 

AGAGCCAGCC GTCCGGCCTT AACTCCACGG ACGACTGGCG CCAGGTCATG GACCGCATCA 12420 

, TGTCGCTGAC TGCGCGCAAT CCTGACGCGT TCCGGCAGCA GCCGCAGGCC AACCGGCTCT 12480 

. CCGCAATTCT GGAAGCGGTG GTCCCGGCGC GCGCAAACCC CACGCACGAG AAGGTGCTGG 12540 

CGATCGTAAA CGCGCTGGCC GAAAACAGGG CCATCCGGCC CGACGAGGCC GGCCTGGTCT 12600 

ACGACGCGCT GCTTCAGCGC GTGGCTCGTT ACAACAGCGG CAACGTGCAG ACCAACCTGG 12660 

. ACCGGCTGGT GGGGGATGTG CGCGAGGCCG TGGCGCAGGG TGAGCGCGCG CAGCAGCAGG 12720 

GCAACCTGGG CTCCATGGTT GCACTAAACG CCTTCCTGAG TACACAGCCC GCCAACGTGC 12780 

. CGCGGGGACA GGAGGACTAC ACCAACTTTG TGAGCGCACT GCGGCTAATG GTGACTGAGA 12 840 

CACCGCAAAG TGAGGTGTAC CAGTCTGGGC CAGACTATTT TTTCCAGACC AGTAGACAAG 12 900 

GCCTGCAGAC CGTAAACCTG AGCCAGGCTT TCAAAAACTT GCAGGGGCTG TGGGGGGTGC 12 960 

GGGCTCCCAC AGGCGACCGC GCGACCGTGT CTAGCTTGCT GACGCCCAAC TCGCGCCTGT 13020 

TGCTGCTGCT AATAGCGCCC TTCACGGACA GTGGCAGCGT GTCCCGGGAC ACATACCTAG 13080 
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GTCACT7GCT 


GACACTGTAC 


CGCGAGGCCA 


TAGGTCAGGC 


GCATGTGGAC 


GAGCATACTT 


1314 0 


TCCAGGAGAT 


TACAAGTGTC 


AGCCGCGCGC 


TGGGGCAGGA 


GGACACGGGC 


AGCCTGGAGG 


13 200 


CAACCCTAAA 


CTACCTGCTG 


ACCAACCGGC 


GGCAGAAGAT 


CCCCTCGTTG 


CACAGTTTAA 


13260 


ACAGCGAGGA 


GGAGCGCATT 


TTGCGCTACG 


TGCAGCAGAG 


CGTGAGCC7T 


AACCTGATGC 


13320 


GCGACGGGGT 


AACGCCCAGC 


GTGGCGCTGG 


ACATGACCGC 


GCGCAACATG 


GAACCGGGCA 


13380 


TGTATGCCTC 


AAACCGGCCG 


TTTATCAACC 


GCCTAATGGA 


CTACTTGCAT 


CGCGCGGCCG 


1344 0 


CCGTGAACCC 


CGAGTATTTC 


ACCAATGCCA 


TCTTGAACCC 


GCACTGGCTA 


CCGCCCCCTG 


1 3 500 


GTTTCTACAC 


CGGGGGATTC 


GAGGTGCCCG 


AGGGTAACGA 


TGGATTCCTC 


TGGGACGACA 


13 560 


TAGACGACAG 


CGTGTTTTCC 


CCGCAACCGC 


AGACCCTGCT 


AGAGTTGCAA 


CAGCGCGAGC 


13620 


AGGCAGAGGC 


GGCGCTGCGA 


AAGGAAAGCT 


TCCGCAGGCC 


AAGCAGCTTG 


TCCGATCTAG 


13680 


GCGCTGCGGC 


CCCGCGGTCA 


GATGCTAGTA 


GCCCATTTCC 


AAGCTTGATA 


GGGTCTCTTA 


13 74 0 


CCAGCACTCG 


CACCACCCGC 


CCGCGCCTGC 


TGGGCGAGGA 


GGAGTACCTA 


AACAACTCGC 


13 800 


TGCTGCAGCC 


GCAGCGCGAA 


AAAAACCTGC 


CTCCGGCATT 


TCCCAACAAC 


GGGATAGAGA 


13860 


GCCTAGTGGA 


CAAGATGAGT 


AGATGGAAGA 


CGTACGCGCA 


GGAGCACAGG 


GACGTGCCAG 


13920 


GCCCGCGCCC 


GCCCACCCGT 


CGTCAAAGGC 


ACGACCGTCA 


GCGGGGTCTG 


GTGTGGGAGG 


13980 


ACGATGACTC 


GGCAGACGAC 


AGCAGCGTCC 


TGGATTTGGG 


AGGGAGTGGC 


AACCCGTTTG 


14040 


CGCACCTTCG 


CCCCAGGCTG 


GGGAGAATGT 


TTTAAAAAAA 


AAAAAGCATG 


ATGCAAAATA 


14100 


AAAAACTCAC 


CAAGGCCATG 


GCACCGAGCG 


TTGGTTTTCT 


TGTATTCCCC 


TTAGTATGCG 


1416 0 


GCGCGCGGCG 


ATGTATGAGG 


AAGGTCCTCC 


TCCCTCCTAC 


GAGAGTG7GG 


TGAGCGCGGC 


14220 


GCCAGTGGCG 


GCGGCGCTGG 


GTTCTCCCTT 


CGATGCTCCC 


CTGGACCCGC 


CGTTTGTGCC 


14280 


TCCGCGGTAC 


CTGCGGCCTA 


CCGGGGGGAG 


AAACAGCATC 


CGTTACTCTG 


AGTTGGCACC 


14340 


CCTATTCGAC 


ACCACCCGTG 


TGTACGTGGT 


GGACAACAAG 


TCAACGGATG 


TGGCATCCCT 


14400 


GAACTACCAG 


AACGACCACA 


GCAACTTTCT 


GACCACGGTC 


ATTCAAAACA 


ATGACTACAG 


14460 


CCCGGGGGAG 


GCAAGCACAC 


AGACCATCAA 


TCTTGACGAC 


CGGTCGCACT 


GGGGCGGCGA 


14520 


CCTGAAAACC 


ATCCTGCATA 


CCAACATGCC 


AAATGTGAAC 


GAGTTCATGT 


TTACCAATAA 


14580 


GTTTAAGGCG 


CGGGTGATGG 


TGTCGCGCTT 


GCCTACTAAG 


GACAATCAGG 


TGGAGCTGAA 


14640 


ATACGAGTGG 


GTGGAGTTCA 


CGCTGCCCGA 


GGGCAACTAC 


TCCGAGACCA 


TGACCATAGA 


14700 


CCTTATGAAC 


AACGCGATCG 


TGGAGCACTA 


CTTGAAAGTG 


GGCAGACAGA 


ACGGGGTTCT 


14760 


GGAAAGCGAC 


ATCGGGGTAA 


AGTTTGACAC 


CCGCAACTTC 


AGACTGGGGT 


TTGACCCCGT 


14820 


CACTGGTCTT 


GTCATGCCTG 


GGGTATATAC 


AAACGAAGCC 


TTCCATCCAG 


ACATCATTTT 


14880 


GCTGCCAGGA 


TGCGGGGTGG 


ACTTCACCCA 


CAGCCGCCTG 


AGCAACTTGT 


TGGGCATCCG 


14940 


CAAGCGGCAA 


CCCTTCCAGG 


AGGGCTTTAG 


GATCACCTAC 


GATGATCTGG 


AGGGTGGTAA 


15000 


CATTCCCGCA 


CTGTTGGATG 


TGGACGCCTA 


CCAGGCGAGC 


TTGAAAGATG 


ACACCGAACA 


15060 


GGGCGGGGGT 


GGCGCAGGCG 


GCAGCAACAG 


CAGTGGCAGC 


GGCGCGGAAG 


AGAACTCCAA 


15120 


CGCGGCAGCC 


GCGGCAATGC 


AGCCGGTGGA 


GGACATGAAC 


GATCATGCCA 


TTCGCGGCGA 


15180 
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CACC777GCC 


/-"iCACGGGCTG 


: AGGAGAAGCG 


CGCTGAGGCC 


GAAGCAGCGG 


: CCGAAGCTGC 


15240 


CGCCCCC jCT 


GCGC.AACCCG 


AGGTCGAGAA 


GCCTCAGAAG 


AAACCGGTGA 


. TCAAACCCCT 


15300 


GACAGAGGAC 


AGCAAGAAAC 


GCAGTTACAA 


CCTAATAAGC 


AATGACAGCA 


. CCTTCACCCA 


15360 


GTACCGCAGC 


TGGTACCTTG 


CATACAACTA 


CGGCGACCCT 


CAGACCGGAA 


TCCGCTCATG 


15420 


GACCCTGCTT 


TGCACTCCTG 


ACGTAACCTG 


CGGCTCGGAG 


CAGGTCTACT 


GGTCGTTGCC 


15480 


AGACATGATG 


C.AAGACCCCG 


TGACCTTCCG 


CTCCACGCGC 


CAGATCAGCA 


ACTTTCCGGT 


15540 


GGTGGGCGCC 


GAGCTGTTGC 


CCGTGCACTC 


CAAGAGGTTG 


TACAACGACC 


AGGCCGTCTA 


15600 


CTCCCAACTC 


ATCCGCCAGT 


TTACGTCTCT 


GACCCACGTG 


TTCAATCGCT 


TTCCCGAGAA 


15660 


CCAGATTTTG 


GCGCGCCCGC 


CAGCGGCGAC 


CATCACCAGC 


GTGAGTGAAA 


ACGTTCCTGG 


15720 


TCTCACAGAT 


CACGGGACGC 


TACCGCTGCG 


CAACAGCATC 


GGAGGAGTGC 


AGCGAGTGAC 


15780 


CATTACTGAC 


GCCAGACGCC 


GCACCTGCCC 


CTACGTTTAC 


AAGGCCCTGG 


GCATAGTCTC 


15840 


GCCGCGCGTC 


CTATCGAGCC 


GCACTTTTTG 


AGCAAGCATG 


TCCATCCTTA 


TATCGCCCAG 


15900 


CAATAACACA 


GGCTGGGGCC 


TGCGCTTCCC 


AAGCAAGATG 


TTTGGCGGGG 


CCAAGAAGCG 


15960 


CTCCGACC.AA 


CACCCAGTGC 


GCGTGCGGGG 


GGACTACCGC 


GGGCCCTGGG 


GCGCGCACAA 


16020 


ACGCGGCCGC 


ACTGGGCGCA 


CCACCGTCGA 


TGACGCGATC 


GACGCGGTGG 


TGGAGGAGGC 


16080 


GCGCAACTAC 


ACGCCCACGC 


CGCCACCAGT 


GTCCAGAGTG 


GACGCGGCCA 


TTCAGACCGT 


16140 


GGTGCGCGGA 


GCCCGGGGCT 


ATGCTAAAAT 


GAAGAGACGG 


GGGAGGCGCG 


TAGCACGTCG 


16200 


CCACCGCCGC 


CGACCGGGCA 


CTGGCGCCCA 


ACGCGCGGCG 


GCGGCGCTGC 


TTAACCGCGC 


16260 


ACGTCGCACC 


GGCCGACGGG 


CGGCCATGCG 


GGCCGCTCGA 


AGGCTGGCCG 


CGGGTATTGT 


16320 


CACTGTGCCC 


CGCAGGTCCA 


GGCGACGAGC 


GGCCGCCGCA 


GCAGCCGCGG 


CCATTAGTGC 


16380 


TATGACTCAG 


GGTCGCAGGG 


GCAACGTGTA 


TTGGGTGCGC 


GACTCGGTTA 


GCGGCCTGCG 


16440 


CGTGCCCGTG 


CGCACCCGGC 


CCCCGCGCAA 


CTAGATTGCA 


AGAAAAAACT 


ACTTAGACTC 


16500 


GTACTGTTGT 


ATGTATCGAG 


CGGCGGCGGC 


GCGCAACGAA 


GCTATGTCCA 


AGCGCAAAAT 


16560 


CAAAGAAGAG 


ATGCTCCAGG 


TCATCGCGCC 


GGAGATCTAT 


GGCCCGCCGA 


AGAAGGAAGA 


16620 


GCAGGATTAC 


AAGCCCCGAA 


AGCTAAAGCG 


GGTCAAAAAG 


AAAAAGAAAG 


ATGATGATGA . 


16680 


TGAAGTTGAC 


GACGAGGTGG 


AACTGCTGCA 


GGCTACCGCG 


CCCAGGCGAC 


GGGTACAGTG 


16740 


GAAAGGTCGA 


CGCGTAAAAC 


GTGTTTTGCG 


ACCCGGCACC 


ACCGTAGTCT 


TTACGCCCGG 


16800 


TGAGCGCTCC 


ACCCGCACCT 


ACAAGCGCGT 


GTATGATGAG 


GTGTACGGCG 


ACGAGGACCT 


16860 


GCTTGAGCAG 


GCCAACGAGC 


GCCTCGGGGA 


GTTTGGCTAC 


GGAAAGCGGG 


ATAAGGACAT 


16920 


GCTGGCGTTG 


CCGCTGGACG 


AGGGCAACCC 


AACACCTAGC 


CTAAAGCCCG 


TAACACTGCA 


16980 


GCAGGTGCTG 


CCCGCGCTTG 


CACCGTCCGA 


AGAAAAGCGC 


GG CCTAAAGC 


GCGAGTCTGG 


17040 


TGACTTGGCA 


CCCACCGTGC 


AGCTGATGGT 


ACCCAAGCGC 


CAGCGACTGG 


AAGATGTCTT 


17100 


GGAAAAAATG 


ACCGTGGAAC 


GTGGGCTGGA 


GCCCGAGGTC 


CGCGTGCGGC 


CAATCAAGCA 


17160 


GGTGGCGCCG 


GGACTGGGCG 


TGCAGACGGT 


GGACGTTCAG 


ATACCCACTA 


CCAGTAGCAC 


17220 


CAGTATTGCC 


ACCGCCACAG 


AGGGCATGGA 


GACACAAACG 


TCCCCGGTTG 


CCTCAGCGGT 


17280 
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GGCGGATGCC 


GCGGTGCAGG 


CGGTCGCTGC 


GGCCGCGTCC 


AAGACCTCTA 


CGGAGGTGCA 


17 340 


AACGGACCCG 


TGGATGTTTC 


GCGTTTCAGC 


CCCCCGGCGC 


CCGCGCGGTT 


CGAGGAAGTA 


174 00 


CGGCGCCGCC 


AGCGCGCTAC 


TGCCCGAATA 


TGCCCTACAT 


CCTTCCATTG 


CGCCTACCCC 


17460 


CGGCTATCGT 


GGCTACACCT 


ACCGCCCCAG 


AAGACGAGCA 


ACTACCCGAC 


GCCGAACCAC 


17520 


CACTGGAACC 


CGCCGCGGCC 


GTCGCCGTCG 


CCAGCCCGTG 


CTGGCCCCGA 


TTTCCGTGCG 


17 580 


CAGGGTGGCT 


CGCGAAGGAG 


GCAGGACCCT 


GGTGCTGCCA 


ACAGCGCGCT 


ACCACCCCAG 


1764 0 


CATCGTTTAA 


AAGCCGGTCT 


TTGTGGTTCT 


TGCAGATATG 


• 

GCCCTCACCT 


GCCGCCTCCG 


17700 


TTTCCCGGTG 


CCGGGATTCC 


GAGGAAGAAT 


GCACCGTAGG 


AGGGGCATGG 


CCGGCCACGG 


17760 


CCTGACGGGC 


GGCATGCGTC 


GTGCGCACCA 


CCGGCGGCGG 


CGCGCGTCGC 


ACCGTCGCAT 


17820 


GCGCGGCGGT 


ATCCTGCCCC 


TCCTTATTCC 


ACTGATCGCC 


GCGGCGATTG 


GCGCCGTGCC 


17880 


CGGAATTGCA 


TCCGTGGCCT 


TGCAGGCGCA 


GAGACACTGA 


TTAAAAACAA 


GTTGCATGTG 


17940 


GAAAAATCAA 


AATAAAAAGT 


CTGGACTCTC 


ACGCTCGCTT 


GGTCCTGTAA 


CTATTTTGTA 


18000 


GAATGGAAGA 


CATCAACTTT 


GCGTCTCTGG 


CCCCGCGACA 


CGGCTCGCGC 


CCGTTCATGG 


18060 


GAAACTGGCA 


AGATATCGGC 


ACCAGCAATA 


TGAGCGGTGG 


CGCCTTCAGC 


TGGGGCTCGC 


18120 


TGTGGAGCGG 


CATTAAAAAT 


TTCGGTTCCA 


CCGTTAAGAA 


CTATGGCAGC 


AAGGCCTGGA 


18180 


ACAGCAGCAC 


AGGCCAGATG 


CTGAGGGATA 


AGTTGAAAGA 


GCAAAATTTC 


CAACAAAAGG 


18240 


TGGTAGATGG 


CCTGGCCTCT 


GGCATTAGCG 


GGGTGGTGGA 


CCTGGCCAAC 


CAGGCAGTGC 


18300 


AAAATAAGAT 


TAACAGTAAG 


CTTGATCCCC 


GCCCTCCCGT 


AGAGGAGCCT 


CCACCGGCCG 


18360 


TGGAGACAGT 


GTCTCCAGAG 


GGGCGTGGCG 


AAAAGCGTCC 


GCGCCCCGAC 


AGGGAAGAAA 


18420 


CTCTGGTGAC 


GCAAATAGAC 


GAGCCTCCCT 


CGTACGAGGA 


GGCACTAAAG 


CAAGGCCTGC 


18480 


CCACCACCCG 


TCCCATCGCG 


CCCATGGCTA 


CCGGAGTGCT 


GGGCCAGCAC 


ACACCCGTAA 


18540 


CGCTGGACCT 


GCCTCCCCCC 


GCCGACACCC 


AGCAGAAACC 


TGTGCTGCCA 


GGCCCGACCG 


18600 


CCGTTGTTGT 


AACCCGTCCT 


AGCCGCGCGT 


CCCTGCGCCG 


CGCCGCCAGC 


GGTCCGCGAT 


18660 


CGTTGCGGCC 


CGTAGCCAGT 


GGCAACTGGC 


AAAGCACACT 


GAACAGCATC 


GTGGGTCTGG 


18720 


GGGTGCAATC 


CCTGAAGCGC 


CGACGATGCT 


TCTGAATAGC 


TAACGTGTCG 


TATGTGTGTC 


18780 


ATGTATGCGT 


CCATGTCGCC 


GCCAGAGGAG 


CTGCTGAGCC 


GCCGCGCGCC 


CGCTTTCCAA 


18840 


GATGGCTACC 


CCTTCGATGA 


TGCCGCAGTG 


GTCTTACATG 


CACATCTCGG 


GCCAGGACGC 


18900 


CTCGGAGTAC 


CTGAGCCCCG 


GGCTGGTGCA 


GTTTGCCCGC 


GCCACCGAGA 


CGTACTTCAG 


18960 


CCTGAATAAC 


AAGTTTAGAA 


ACCCCACGGT 


GGCGCCTACG 


CACGACGTGA 


CCACAGACCG 


19020 


GTCCCAGCGT 


TTGACGCTGC 


GGTTCATCCC 


TGTGGACCGT 


GAGGATACTG 


CGTACTCGTA 


19080 


CAAGGCGCGG 


TTCACCCTAG 


CTGTGGGTGA 


TAACCGTGTG 


CTGGACATGG 


CTTCCACGTA 


19140 


CTTTGACATC 


CGCGGCGTGC 


TGGACAGGGG 


CCCTACTTTT 


AAGCCCTACT 


CTGGCACTGC 


19200 






ACjGG tgcccc 


AAATCCTTG C 


GAATGGGATG 


AAGCTGCTAC 


19260 


TGCTCTTGAA 


ATAAACCTAG 


AAGAAGAGGA 


CGATGACAAC 


GAAGACGAAG 


TAGACGAGCA 


19320 


AGCTGAGCAG 


CAAAAAACTC 


ACGTATTTGG 


GCAGGCGCCT 


TATTCTGGTA 


TAAATATTAC 


19380 
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AAAGGAGGGT ATTCAAATAG GTGTCGAAGG TCAAACACCT AAATATGCCG ATAAAACATT 1-9440 

TCAACC7GAA CCTCAAATAG GAGAATCTCA GTGGTACGAA ACTGAAATTA ATCATGCAGC 195 00 

TGGGAGAGTC CTTAAAAAGA CTACCCCAAT GAAACCATGT TACGGTTCAT ATGCAAAAC C 195S 0 

CACAAATGAA AATGGAGGGC AAGGCATTCT TGTAAAGCAA CAAAATGGAA AGCTAGAAAG 19 620 
TCAAGTGGAA ATGCAATTTT TCTCAACTAC TGAGGCGACC GCAGGCAATG GTGATAACTT 
GACTCCTAAA GTGGTATTGT ACAGTGAAGA TGTAGATATA GAAACCCCAG ACACTCATAT 
TTCTTACATG CCCACTATTA AGGAAGGTAA CTCACGAGAA CTAATGGGCC AACAATCTAT 
GCCCAACAGG CCTAATTACA TTGCTTTTAG GGACAATTTT ATTGGTCTAA TGTATTACAA 
CAGCACGGGT AATATGGGTG TTCTGGCGGG CCAAGCATCG CAGTTGAATG CTGTTGTAGA 

TTTGCAAGAC AGAAACACAG AGCTTTCATA CCAGCTTTTG CTTGATTCCA TTGGTGATAG i 998 o 

AACCAGGTAC TTTTCTATGT GGAATCAGGC TGTTGACAGC TATGATCCAG ATGTTAGAAT 20040 

TATTGAAAAT CATGGAACTG AAGATGAACT TCCAAATTAC TGCTTTCCAC TGGGAGGTGT 20100 

GATTAATACA GAGACTCTTA CCAAGGTAAA ACCTAAAACA GGTCAGGAAA ATGGATGGGA 20160 

AAAAGATGCT ACAGAATTTT CAGATAAAAA TGAAATAAGA GTTGGAAATA ATTTTGCCAT 20220 

GGAAATCAAT CTAAATGCCA ACCTGTGGAG AAATTTCCTG TACTCCAACA TAGCGCTGTA 20280 

TTTGCCCGAC AAGCTAAAGT ACAGTCCTTC CAACGTAAAA ATTTCTGATA ACCCAAACAC 203 40 

CTACGACTAC ATGAACAAGC GAGTGGTGGC TCCCGGGTTA GTGGACTGCT ACATTAACCT 204 00 

TGGAGCACGC TGGTCCCTTG ACTATATGGA CAACGTCAAC CCATTTAACC ACCACCGCAA 20460 

TGCTGGCCTG CGCTACCGCT CAATGTTGCT GGGCAATGGT CGCTATGTGC CCTTCCACAT 20520 

CCAGGTGCCT CAGAAGTTCT TTGCCATTAA AAACCTCCTT CTCCTGCCGG GCTCATACAC 20580 

CTACGAGTGG AACTTCAGGA AGGATGTTAA CATGGTTCTG CAGAGCTCCC TAGGAAATGA 20640 

CCTAAGGGTT GACGGAGCCA GCATTAAGTT TGATAGCATT TGCCTTTACG CCACCTTCTT 20700 

CCCCATGGCC CACAACACCG CCTCCACGCT TGAGGCCATG CTTAGAAACG ACACCAACGA 20760 

CCAGTCCTTT AACGACTATC TCTCCGCCGC CAACATGCTC TACCCTATAC CCGCCAACGC 20820 

TACCAACGTG CCCATATCCA TCCCCTCCCG CAACTGGGCG GCTTTCCGCG GCTGGGCCTT 20880 

CACGCGCCTT AAGACTAAGG AAACCCCATC ACTGGGCTCG GGCTACGACC CTTATTACAC 20940 

CTACTCTGGC TCTATACCCT ACCTAGATGG AACCTTTTAC CTCAACCACA CCTTTAAGAA 21000 

GGTGGCCATT ACCTTTGACT CTTCTGTCAG CTGGCCTGGC AATGACCGCC TGCTTACCCC 21060 

CAACGAGTTT GAAATTAAGC GCTCAGTTGA CGGGGAGGGT TACAACGTTG CCCAGTGTAA 21120 

CATGACCAAA GACTGGTTCC TGGTACAAAT GCTAGCTAAC TACAACATTG GCTACCAGGG 21180 

CTTCTATATC CCAGAGAGCT ACAAGGACCG CATGTACTCC TTCTTTAGAA ACTTCCAGCC 2124 0 

CATGAGCCGT CAGGTGGTGG ATGATACTAA ATACAAGGAC TACCAACAGG TGGGCATCCT 21300 

ACACCAACAC AACAACTCTG GATTTGTTGG CTACCTTGCC CCCACCATGC G CGAAGG ACA 21360 

GGCCTACCCT GCTAACTTCC CCTATCCGCT TATAGGCAAG ACCGCAGTTG ACAGCATTAC 214 20 

CCAGAAAAAG TTTCTTTGCG ATCGCACCCT TTGGCGCATC CCATTCTCCA GTAACTTTAT 21480 
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GTCCATGGGC 


GCACTCACAG 


ACCTGGGCCA 


AAACCTTCTC 


TACGCCAACT 




2154 0 


GCTAGACATG 


ACTTTTGAGG 


TGGATCCCAT 


GGACGAGCCC 


ACCCTTCTTT 




^ lb 00 


TGAAGTCTTT 


GACGTGGTCC 


GTGTGCACCG 


GCCGCACCGC 


GGCGTCATCG 




TICCA 
£. 1 D O U 


CCTGCGCACG 


CCCTTCTCGG 


CCGGCAACGC 


CACAACATAA 


AGAAGCAAGC 


AACATCAACA 


-)i ion 


ACAGCTGCCG 


CCATGGGCTC 


CAGTGAGCAG 


GAACTGAAAG 


CCATTGTCAA 


AG ATGTTnn T 


-1 'OU 


TGTGGGCCAT 


ATTTTTTGGG 


CACCTATGAC 


AAGCGCTTTC 


CAGGCTTTGT 






AAGCTCGCCT 


GCGCCATAGT 
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ATACCACGCG CCACTGGGTC GTCTTCATTC AGCCGCCGCA CTGTGCGCTT ACCTCCTTTG 23640 
CCATGCTTGA TTAGCACCGG TGGGTTGCTG AAACCCACCA TTTGTAGCGC CACATCTTCT 23700 
CTTTCTTCCT CGCTGTCCAC GATTACCTCT GGTGATGGCG GGCGCTCGGG CTTGGGAGAA 23760 
GGGCGCTTCT TTTTCTTCTT GGGCGCAATG GCCAAATCCG CCGCCGAGGT CGATGGCCGC 23820 
GGGCTGGGTG TGCGCGGCAC CAGCGCGTCT TGTGATGAGT CTTCCTCGTC CTCGGACTCG 23880 
ATACGCCGCC TCATCCGCTT TTTTGGGGGC GCCCGGGGAG GCGGCGGCGA CGGGGACGGG 23940 
GACGACACGT CCTCCATGGT TGGGGGACGT CGCGCCGCAC CGCGTCCGCG CTCGGGGGTG 24000 
GTTTCGCGCT GCTCCTCTTC CCGACTGGCC ATTTCCTTCT CCTATAGGCA GAAAAAGATC 
ATGGAGTCAG TCGAGAAGAA GGACAGCCTA ACCGCCCCCT CTGAGTTCGC CACCACCGCC 
TCCACCGATG CCGCCAACGC GCCTACCACC TTCCCCGTCG AGGCACCCCC GCTTGAGGAG 
GAGGAAGTGA TTATCGAGCA GGACCCAGGT TTTGTAAGCG AAGACGACGA GGACCGCTCA 
GTACCAACAG AGGATAAAAA GCAAGACCAG GACAACGCAG AGGCAAACGA GGAACAAGTC 
GGGCGGGGGG ACGAAAGGCA TGGCGACTAC CTAGATGTGG GAGACGACGT GCTGTTGAAG 
CATCTGCAGC GCCAGTGCGC CATTATCTGC GACGCGTTGC AAGAGCGCAG CGATGTGCCC 
CTCGCCATAG CGGATGTCAG CCTTGCCTAC GAACGCCACC TATTCTCACC GCGCGTACCC 
CCCAAACGCC AAGAAAACGG CACATGCGAG CCCAACCCGC GCCTCAACTT CTACCCCGTA 
TTTGCCGTGC CAGAGGTGCT TGCCACCTAT CACATCTTTT TCCAAAACTG CAAGATACCC 
CTATCCTGCC GTGCCAACCG CAGCCGAGCG GACAAGCAGC TGGCCTTGCG GCAGGGCGCT 24660 
GTCATACCTG ATATCGCCTC GCTCAACGAA GTGCCAAAAA TCTTTGAGGG TCTTGGACGC 24 72 0 
GACGAGAAGC GCGCGGCAAA CGCTCTGCAA CAGGAAAACA GCGAAAATGA AAGTCACTCT 24730 
GGAGTGTTGG TGGAACTCGA GGGTGACAAC GCGCGCCTAG CCGTACTAAA ACGCAGCATC 24840 
GAGGTCACCC ACTTTGCCTA CCCGGCACTT AACCTACCCC CCAAGGTCAT GAGCACAGTC 24900 
ATGAGTGAGC TGATCGTGCG CCGTGCGCAG CCCCTGGAGA GGGATGCAAA TTTGCAAGAA 24 96 0 
CAAACAGAGG AGGGCCTACC CGCAGTTGGC GACGAGCAGC TAGCGCGCTG GCTTCAAACG 25020 
CGCGAGCCTG CCGACTTGGA GGAGCGACGC AAACTAATGA TGGCCGCAGT GCTCGTTACC 25080 
GTGGAGCTTG AGTGCATGCA GCGGTTCTTT GCTGACCCGG AGATGCAGCG CAAGCTAGAG 
GAAACATTGC ACTACACCTT TCGACAGGGC TACGTACGCC AGGCCTGCAA GATCTCCAAC 
GTGGAGCTCT GCAACCTGGT CTCCTACCTT GGAATTTTGC ACGAAAACCG CCTTGGGCAA 
AACGTGCTTC ATTCCACGCT CAAGGGCGAG GCGCGCCGCG ACTACGTCCG CGACTGCGTT 
TACTTATTTC TATGCTACAC CTGGCAGACG GCCATGGGCG TTTGGCAGCA GTGCTTGGAG 
GAGTGCAACC TCAAGGAGCT GCAGAAACTG CTAAAGCAAA ACTTGAAGGA CCTATGGACG 
GCCTTCAACG AGCGCTCCGT GGCCGCGCAC CTGGCGGACA TCATTTTCCC CGAACGCCTG 
CTTAAAACCC TGCAACAGGG TCTGCCAGAC TTCACCAGTC AAAGCATGTT GCAGAACTTT 2 5560 
AGGAACTTTA TCCTAGAGCG CTCAGGAATC TTGCCCGCCA CCTGCTGTGC ACTTCCTAGC 25620 
GACTTTGTGC CCATTAAGTA CCGCGAATGC CCTCCGCCGC TTTGGGGCCA CTCCTACCTT 2 5680 
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GACCTCCCGG CCAC7ATCCG GATCAATTTA TTCCTAACTT TGACGCGGTA AAGGACTCGG :7840 

CGGACGGC7A CGACTGAATG TTAAGTGGAG AGGCAGAGCA ACTGCGCCTG AAACACCTGG Z7900 

7CCAC7G7CG CCGCCACAAG TGCTTTGCCC GCGACTCCGG TGAGTTTTGC TACTTTGAAT "960 

7GCCCGAGGA TCATATCGAG GGCCCGGCGC ACGGCGTCCG GCTTACCGCC CAGGGAGAGC :8020 
TTGCCCG7AG CCTGATTCGG GAGTTTACCC AGCGCCCCCT GCTAGTTGAG CGGGACAGGG 
GACCCTGTGT TCTCACTGTG ATTTGCAACT GTCCTAACCT TGGATTACAT CAAGATCTT7 
GTTGCC.-.TCT CTGTGCTGAG TATAATAAAT ACAGAAATTA AAATATACTG GGGCTCCTAT 
CGCCATCCTG TAAACGCCAC CGTCTTCACC CGCCCAAGCA AACCAAGGCG AACCT7ACC7 
GG7AC7777A ACATCTCTCC CTCTGTGATT TACAACAGTT TCAACCCAGA CGGAGTGAG7 
CTACGAGAGA ACCTCTCCGA GCTCAGCTAC TCCATCAGAA AAAACACCAG CCTCCTTACC 
TGCCGGGAAC GTACGAGTGC GTCACCGGCC GCTGCACCAC ACCTACCGCC TGACCGTAAA 
. CCAGAC7777 TCCGGACAGA CCTCAATAAC TCTGTTTACC AGAACAGGAG GTGAGCTTAG 

AAAACCC77A GGGTATTAGG CCAAAGGCGC AGCTACTGTG GGGTTTATGA ACAATTCAAG 28560 

CAACTCTACG GGCTATTCTA ATTCAGGTTT CTCTAGAATC GGGGTTGGGG TTATTCTCTG 28620 

TCTTGTGATT CTCTTTATTC TTATACTAAC GCTTCTCTGC CTAAGGCTCG CCGCCTGCTG 28680 

TGTGCACATT TGCATTTATT GTCAGCTTTT TAAACGCTGG GGTCGCCACC CAAGATGATT ' 28740 

AGGTACA7AA TCCTAGGTTT ACTCACCCTT GCGTCAGCCC ACGGTACCAC CCAAAAGGTG 28800 

GATTTTAAGG AGCCAGCCTG TAATGTTACA TTCGCAGCTG AAGCTAATGA G7GCACCACT 28860 

CT7ATAAAA7 GCACCACAGA ACATGAAAAG C7GC77A7TC GCCACAAAAA CAAAA77GGC 28920 

AAG7A7GC7G T77ATGCTA7 77GGCAGCCA GG7GACACTA CAGAG7A7AA TG77ACAG77 28980 

77CCAGGG7A AAAG7CATAA AACTTT7ATG 7A7ACTTT7C CA7T7TA7GA AA7G7GCGAC 29040 

AT7ACCA7G7 ACATGAGCAA ACAGTA7AAG T7GTGGCCCC CACAAAA77G TG7GGAAAAC 29100 

ACTGGCAC7T 7CTGC7GCAC 7GCTATGCTA ATTACAGTGC TCGC7TTGG7 C7G7ACCC7A 29160 

CTCTA7A77A AATACAAAAG CAGACGCAGC TTTATTGAGG AAAAGAAAA7 GCC7TAAT77 29220 

ACTAAG77AC AAAGCTAATG TCACCACTAA CTGCTTTACT CGCTGCTTGC AAAACAAATT 29280 

■CAAAAAG7TA GCATTATAAT TAGAATAGGA 7TTAAACCCC CCGGTCATT7 CCTGC7CAA7 29340 

ACCATTCCCC TGAACAATTG AC7CTATGTG GGATATGCTC CAGCGC7ACA ACC7TGAAGT 29400 

CAGGCT7CCT GGATGTCAGC ATCTGACTTT GGCCAGCACC 7GTCCCGCGG A7TTG7TCCA 29460 

^GTCCAAC7AC AGCGACCCAC CCTAACAGAG A7GACCAACA CAACCAACGC GGCCGCCGCT 2 9520 

ACCGGAC77A CATCTACCAC AAATACACCC CAAG7TTCTG CCTTTGTCAA 7AACTGGGAT 29580 

AAC77GGGCA TG7GGTGGTT CTCCA7AGCG CT7ATG7TTG TA7GCCTTA7 TATTA7GTGG 2 9640 

CTCATC7GCT GCCTAAAGCG CAAACGCGCC CGACCACCCA TCTATAGTCC CATCATTG7G 2 9700 

CTACACCCAA ACAATGA7GG AA7CCA7AGA TTGGACGGAC 7GAAACACA7 G7TCT77TCT 29760 

CTTACAG7AT GATTAAATGA GACATGATTC CTCGAGTTTT TATA7TACTG ACCCTTGTTG 29820 

CGC77T77TG 7GCG7GC7CC ACATTGGC7G CGG7TTCTCA CA7CGAAGTA GAC7GCAT7C 2 9880 



- 83 - 



WO 98/17783 PCT/US97/19541 

CAGCCTTCAC AGTCTATTTG CTTTACGGAT TTGTCACCCT CACGCTCATC TGCAGCCTCA 2 9940 

TCACTGTGGT CATCGCCTTT ATCCAGTGCA TTGACTGGGT CTGTGTGCGC 7TTGCATATC 30 0 00 

TCAGACACCA TCCCCAGTAC AGGGACAGGA CTATAGCTGA GCTTCTTAGA ATTCTTTAAT 30060 

TATGAAATTT ACTGTGACT7 TTCTGCTGAT TATTTGCACC CTATCTGCGT TTTGTTCCCC 30120 

GACCTCCAAG CCTCAAAGAC ATATATCATG CAGATTCACT CGTATATGGA ATATTCCAAG 3018 0 

TTGCTACAAT GAAAAAAGCG ATCTTTCCGA AGCCTGGTTA TATGCAATCA TCTCTGTTAT 30240 

GGTGTTCTGC AGTACCATCT TAGCCCTAGC TATATATCCC TACCTTGACA TTGGCTGGAA 30300 

ACGAATAGAT GCCATGAACC ACCCAACTTT CCCCGCGCCC GCTATGCTTC CACTGCAACA 3036 0 

AGTTGTTGCC GGCGGCTTTG TCCCAGCCAA TCAGCCTCGC CCCACTTCTC CCACCCCCAC 30420 

TGAAATCAGC TACTTTAATC TAACAGGAGG AGATGACTGA CACCCTAGAT CTAGAAATGG 304 8 0 

ACGGAATTAT TACAGAGCAG CGCCTGCTAG AAAGACGCAG GGCAGCGGCC GAGCAACAGC 30540 

GCATGAATCA AGAGCTCCAA GACATGGTTA ACTTGCACCA GTGCAAAAGG GGTATCTTTT 30600 

GTCTGGTAAA GCAGGCCAAA GTCACCTACG ACAGTAATAC CACCGGACAC CGCCTTAGCT 30660 

ACAAGTTGCC AACCAAGCGT CAGAAATTGG TGGTCATGGT GGGAGAAAAG CCCATTACCA 30720 

TAACTCAGCA CTCGGTAGAA ACCGAAGGCT GCATTCACTC ACCTTGTCAA GGACCTGAGG 30780 

ATCTCTGCAC CCTTATTAAG ACCCTGTGCG GTCTCAAAGA TCTTATTCCC TTTAACTAAT 30840 

AAAAAAAAAT AAT AAAG CAT CACTTACTTA AAATCAGTTA GCAAATTTCT GTCCAGTTTA 30 900 

TTCAGCAGCA CCTCCTTGCC CTCCTCCCAG CTCTGGTATT GCAGCTTCCT CCTGGCTGCA 3 0 960 

AACTTTCTCC ACAATCTAAA TGGAATGTCA GTTTCCTCCT GTTCCTGTCC ATCCGCACCC 31020 

ACTATCTTCA TGTTGTTGCA GATGAAGCGC GCAAGACCGT CTGAAGATAC CTTCAACCCC 31080 

GTGTATCCAT ATGACACGGA AACCGGTCCT CCAACTGTGC CTTTTCTTAC TCCTCCCTTT 3114 0 

GTATCCCCCA ATGGGTTTCA AGAGAGTCCC CCTGGGGTAC TCTCTTTGCG CCTATCCGAA 31200 
CCTCTAGTTA CCTCCAATGG CATGCTTGCG CTCAAAATGG GCAACGGCCT CTCTCTGGAC . 3126 0 

GAGGCCGGCA ACCTTACCTC CCAAAATGTA ACCACTGTGA GCCCACCTCT CAAAAAAACC 31320 

AAGTCAAACA TAAACCTGGA AATATCTGCA CCCCTCACAG TTACCTCAGA AGCCCTAACT 31380 

GTGGCTGCCG CCGCACCTCT AATGGTCGCG GGCAACACAC TCACCATGCA ATCACAGGCC 31440 

CCGCTAACCG TGCACGACTC CAAACTTAGC ATTGCCACCC AAGGACCCCT CACAGTGTCA 31500 

GAAGGAAAGC TAGCCCTGCA AACATCAGGC CCCCTCACCA CCACCGATAG CAGTACCCTT 31560 

ACTATCACTG CCTCACCCCC TCTAACTACT GCCACTGGTA GCTTGGGCAT TGACTTGAAA 31620 

GAGCCCATTT ATACACAAAA TGGAAAACTA GGACTAAAGT ACGGGGCTCC TTTGCATGTA 31680 

ACAGACGACC TAAACACTTT GACCGTAGCA ACTGGTCCAG GTGTGACTAT TAATAATACT 31740 

TCCTTGCAAA CTAAAGTTAC TGGAGCCTTG GGTTTTGATT CACAAGGCAA TATGCAACTT 31800 

AATGTAGCAG GAGGACTAAG GATTGATTCT CAAAACAGAC GCCTTATACT TGATGTTAGT 31860 

TATCCGTTTG ATGCTCAAAA CCAACTAAAT CTAAGACTAG GACAGGGCCC TCTTTTTATA 31920 

AACTCAGCCC ACAACTTGGA TATTAACTAC AACAAAGGCC TTTACTTGTT TACAGCTTCA 31980 
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AACAA.TCCA AAAAG CTTG A GGTTAACCTA AGCACTGCCA AGGGGTTGAT GTTTGACGCT 3 204 0 
ACAGCCATAG CCATTAATGC AGGAGATGGG CTTGAATTTG GTTCACCTAA TGCACCAAAC 3 2100 
ACAAATCCCC TCAAAACAAA AATTGGCCA7 GGCCTAGAAT TTGATTCAAA CAAGGCTATG 
GTTCCTAAAC TAGGAACTGG CCTTAGTTTT GACAGCACAG GTGCCATTAC AGTAGGAAAC 
AAAAATAATG ATAAGCTAAC TTTGTGGACC ACACCAGCTC CATCTCCTAA CTGTAGACTA 
AATGCAGAGA AAGATGCTAA ACTCACTTTG GTCTTAACAA AATGTGGCAG TCAAATACTT 
GCTACAGTTT CAGTTTTGGC TGTTAAAGGC AGTTTGGCTC CAATATCTGG AACAGTTCAA 
AGTGCTCATC TTATTATAAG ATTTGACGAA AATGGAGTGC TACTAAACAA TTCCTTCCTG 
GACCCAGAAT ATTGGAACTT TAGAAATGGA GATCTTACTG AAGGCACAGC CTATACAAAC 32520 
GCTGTTGGAT TTATGCCTAA CCTATCAGCT TATCCAAAAT CTCACGGTAA AACTGCCAAA 3 2580 
AGTAACATTG TCAGTCAAGT TTACTTAAAC GGAGACAAAA CTAAACCTGT AACACTAACC 32640 
ATTACACTAA ACGGTACACA GGAAACAGGA GACACAACTC CAAGTGCATA CTCTATGTCA 32700 
TTTTCATGGG ACTGGTCTGG CCACAACTAC ATTAATGAAA TATTTGCCAC ATCCTCTTAC 
ACTTTTTCAT ACATTGCCCA AGAATAAAGA ATCGTTTGTG TTATGTTTCA ACGTGTTTAT 
TTTTCAATTG CAGAAAATTT CAAGTCATTT TTCATTCAGT AGTATAGCCC CACCACCACA 
TAGCTTATAC AGATCACCGT ACCTTAATCA AACTCACAGA ACCCTAGTAT TCAACCTGCC 
ACCTCCCTCC CAACACACAG AGTACACAGT CCTTTCTCCC CGGCTGGCCT T AAAAAG CAT 
CATATCATGG GTAACAGACA TATTCTTAGG TGTTATATTC CACACGGTTT CCTGTCGAGC 33060 
CAAACGCTCA TCAGTGATAT TAATAAACTC CCCGGGCAGC TCACTTAAGT TCATGTCGCT 3 312 0 
GTCCAGCTGC TGAGCCACAG GCTGCTGTCC AACTTGCGGT TGCTTAACGG GCGGCGAAGG 
AGAAGTCCAC GCCTACATGG GGGTAGAGTC ATAATCGTGC ATCAGGATAG GGCGGTGGTG 
CTGCAGCAGC GCGCGAATAA ACTGCTGCCG CCGCCGCTCC GTCCTGCAGG AATACAACAT 
GGCAGTGGTC TCCTCAGCGA TGATTCGCAC CGCCCGCAGC ATAAGGCGCC TTGTCCTCCG 
GGCACAGCAG CGCACCCTGA TCTCACTTAA ATCAGCACAG TAACTGCAGC ACAGCACCAC 
AATATTGTTC AAAATCCCAC AGTGCAAGGC GCTGTATCCA AAGCTCATGG CGGGGACCAC 
AGAACCCACG TGGCCATCAT ACCACAAGCG CAGGTAGATT AAGTGGCGAC CCCTCATAAA 33 540 
CACGCTGGAC ATAAACATTA CCTCTTTTGG CATGTTGTAA TTCACCACCT CCCGGTACCA 33600 
TATAAACCTC TGATTAAACA TGGCGCCATC CACCACCATC CTAAACCAGC TGGCCAAAAC 
CTGCCCGCCG GCTATACACT GCAGGGAACC GGGACTGGAA CAATGACAGT GGAGAGCCCA 
GGACTCGTAA CCATGGATCA TCATGCTCGT CATGATATCA ATGTTGGCAC AACACAGGCA 
CACGTGCATA CACTTCCTCA GGATTACAAG CTCCTCCCGC GTTAGAACCA TATCCCAGGG 
AACAACCCAT TCCTGAATCA GCGTAAATCC CACACTGCAG GGAAGACCTC GCACGTAACT 
CACGTTGTGC ATTGTCAAAG TGTTACATTC GGGCAGCAGC GGATGATCCT CCAGTATGGT 3 3 960 
AGCGCGGGTT TCTGTCTCAA AAGGAGGTAG ACGATCCCTA CTGTACGGAG TGCGCCGAGA 34 020 
CAACCGAGAT CGTGTTGGTC GTAGTGTCAT GCCAAATGGA ACGCCGGACG TAGTCATATT 34080 



32760 
32820 
32880 
32940 
33000 



33180 
33240 
33300 
33360 
33420 
33480 



33660 
33720 
33780 
33840 
33900 
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TCC7GAAGCA 


AAACCAGGTG 


CGGGCGTGAC 


AAACAGATCT 

******* v_ rtvjri 




i I t-GCCuv. . 


34140 


TAGATCGCTC 


TGTGTAG7AG 


TTG TAG TATA 


TCCACTCTCT 


CAAAnpATPP 


n VJO V_ O*- t_ C v_ 


34200 




TTCTATGTAA 


ACTCCTTCAT 


GCGCCGCTGC 


PPTPATAAPA 




34260 


CAGAATAAGC 


CACACCCAGC 


CAACGTACAC 


ATTCGTTCTG 






34320 


CGGGAAGAGC 


TGGAAGAACC 


ATGTTTTTTT 


TTTTATTCCA 


AAAGATTATP 




34380 


AAATGAAGAT 


CTATTAAGTG 


AACGCGCTCC 


CCTCCGGTGG 


CGTGGTPA A A 




3444 0 


AAAGAACAGA 


TAATGGCATT 


TGTAAGATGT 


TGCACAATGG 


CTTPPAAAAP 




34500 


CTCACGTCCA 


AGTGGACGTA 


AAGGCTAAAC 


CCTTCAGGGT 


GAATCTCCTC 




34 560 


CCAGCACCTT 


CAACCATGCC 


CAAATAATTC 


TCATCTCGCC 


ACCTTCTCAA 


TiTSTPTPTJ. 


34620 


AGCAAATCCC 


GAATATTAAG 


TCGGGCCATT 


GTAAAAATCT 


GCTCCAGAGC 




34680 


TTCAGCGTCA 


AGCAGCGAAT 


CATGATTGCA 


AAAATTCAGG 


TTCCTCACAG 


______ 

^LL luirtl An 


34 74 0 


GATTCAAAAG 


CGGAACATTA 


ACAAAAATAC 


CGCGATCCCG 


TAGGTCCCTT 


CGCACnPPP^ 


1 A Q rt A 


GCTGAACATA 


ATCGTGCAGG 


TCTGCACGGA 


CCAGCGCGGC 


CACTTCCPPG 


PPAPn A arr'" 


34860 


TGACAAAAGA 


ACCCACACTG 


ATTATGACAC 


GCATACTCGG 


AGCTATGCTA 




j 4 y j u 


CCCCGATGTA 


AGCTTTGTTG 


GATGGGCGGC 


GATATAAAAT 


GCAAGGTGPT 


cir"Tc & & & ft a ■* 


34 980 


TCAGGCAAAG 


CGTCGCGCAA 


AAAAGAAAGC 


ACATCGTAGT 


CATGPTPATP 


nr.!\Tr\ n n 
L-rtVj/A 1 AAACjvj 


3504 0 


CAGGTAAGCT 


CCGGAACCAC 


CACAGAAAAA 


GACACCATTT 


TTrTrTC AHA 


Irt I Cj I L TGCG 


35100 


GGTTTCTGCA 


TAAACACAAA 


ATAAAATAAC 


AAAAAAACAT 


TTAAAPATTA 


Vj>\i\Vjv_L lb IL 


35160 


TTACAACAGG 


AAAAACAACC 


CTTATAAGCA 


TAAGACGGA P 




V-LLjL>Q_Li lGAC 


35220 


CGTAAAAAAA 


CTGGTCAGCG 


TGATTAAAAA 


GCACCACCGA 


papptpptpti 




35280 


GAGTCATAAT 


GTAAGACTCG 


GTAAACACAT 


CAGGTTGATT 


p at cnciTc a n 


TT^^Tft ft ft ft A r* 


3534 0 


CGACCGAAAT 


AGCCCGGGGG 


AATACATACC 


CGCAGGCGTA 


hagapa araT 


1 MLjL_ L_ LLl 


i c a n n 
J b4 00 


ATAGGAGGTA 


TAACAAAATT 


AATAGGAGAG 


AAAAACACAT 


AAAPAPPTPI A 




J b4 b U 


TGCCTAGGCA 


AAATAGCACC 


CTCCCGCTCC 


AGAACAACAT 


ACAGPGPTTP 


AP Anrprrsr 




CCTAACAGTC 


AGCCTTACCA 


GTAAAAAAGA 


AAACCTATTA 


AAAAAAPAPP 


A PTPn ara cc 




GCACCAGCTC 


AATCAGTCAC 


AGTGTAAAAA 


AGGGCCAAGT 


gpapapppap 






ACTAAAAAAT 


GACGTAACGG 


TTAAAGTCCA 


CAAAAAACAC 


CC AG AAA APP 


ppAprinr*!! ar 




CTACGCCCAG 


AAACGAAAGC 


CAAAAAACCC 


ACAACTTCCT 


CAAATPPTPA 

^fww 1 v- w i L_.rt 


V- 1 1 H—Lj ill. 


jj / Ou 


CCC ft CCTT ft f 


G T AACTTC C C 


ATTTTAAGAA 


AACTACAATT 


CCCAACACAT 


ACAAGTTACT 


35820 


CCGCCCTAAA 


ACCTACGTCA 


CCCGCCCCGT 


TCCCACGCCC 


CGCGCCACGT 


CACAAACTCC 


35880 


ACCCCCTCAT 


TATCATATTG 


GCTTCAATCC 


AAAATAAGGT 


ATATTATTGA 


TGATG 


35935 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

( D) TOPOLOGY: linear 
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MOLECULE TYPE: ocher nucleic acid 
(A) DESCRIPTION: /desc = " DNA " 

txi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

TTCATTTTAT GTTTCAGGTT CAGGG 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS • 
(A) LENGTH: 20 base pairs 
(3) TYPE: nucleic acid 

(C) STRAND EDMESS : single 

(D) TOPOLOGY: linear 

iii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

txi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

TTACCGCCAC ACTCGCAGGG 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS ■ 

(A) LENGTH: 34303 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
. (D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: ocher nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

TTATTTTGGA TTGAAGCCAA TATGATAATG AGGGGGTGGA GTTTGTGACG 

CGTGGGAACG GGGCGGGTGA CGTAGTAGTG TGGCGGAAGT GTGATGTTGC 

GAACACATGT AAGCGACGGA TGTGGCAAAA GTGACGTTTT TGGTGTGCGC 

GGACGGGTGT GGTCGCCATG ATCGCGTAGT CGATAGTGGC TCCAAGTAGC 

GGACTGGGCG GCGGCCAAAG CGGTCGGACA GTGCTCCGAG AACGGGTGCG 

GCATCAACGC ATATAGCGCT AGCAGCACGC CATAGTGACT GGCGATGCTG 

CGATATCCCG CAAGAGGCCC GGCAGTACCG GCATAACCAA GCCTATGCCT 

GGGTGACGGT GCCGAGGATG ACGATGAGCG CATTGTTAGA TTTCATACAC 

TGCGTTAGCA ATTTAACTGT GATAAACTAC CGCATTAAAG CTTATCGATG 

AAACATGAGA ATTCTTGAAG ACGAAAGGGC CTCGTGATAC GCCTATTTTT 

GTCATGATAA TAATGGTTTC TTAGACGTCA GGTGGCACTT TTCGGGGAAA 

ACCCCTATTT GTTTATTTTT CTAAATACAT TCAAATATGT ATCCGCTCAT 

CCCTGATAAA TGCTTCAATA ATATTGAAAA AGGAAGAGTA TGAGTATTCA 

GTCGCCCTTA TTCCCTTTTT TGCGGCATTT TGCCTTCCTG TTTTTGCTCA 

CTGGTGAAAG TAAAAGATGC TGAAGATCAG TTGGGTGCAC GAGTGGGTTA 

GATCTCAACA GCGGTAAGAT CCTTGAGAGT TTTCGCCCCG AAGAACGTTT 

AGCACTTTTA AAGTTCTGCT ATGTGGCGCG GTATTATCCC GTGTTGACGC 

CAACTCGGTC GCCGCATACA CTATTCTCAG AATGACTTGG TTGAGTACTC 
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TGGCGCGGGG 
AAGTGTGGCG 
CGGATCCACA 
GAAGCGAGCA 
CATAGAAATT 
TCGGAATGGA 
ACAGCATCCA 
GGTGCCTGAC 
ATAAGCTGTC 
ATAGGTTAAT 
TGTGCGCGGA 
GAGACAATAA 
ACATTTCCGT 
CCCAGAAACG 
CATCGAACTG 
TCCAATGATG 
CGGGCAAGAG 
ACCAGTCACA 



25 



20 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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PCT/US97/19541 


paaaapp* " r * / ~ 


"""** a PPP i TPP 


L.nibAuAGTA 


AGAGAATTA.T 


GCAGTGCTGC 


PAT A. A PP A 

*w A i AAGCA x \j 


1140 


rt\J 1 VJrt a rwL .AA 


crczmczrc a a 

L 1 VjLIjvjl LMn 


Ll i AL 1 ILlb 


ACAACGATCG 


GAGGACCGAA 


PP7APPTIV n p^ 
GvjAGG TAACC 


1200 


nrTTTT*".^r 

u\- 1 I 1 1 . . 'JL 


HLrtnLrt 1 LjLLj 


n *r*P" a t , pt' n 
UvjA I C ATGTA 


ACTCGCCTTG 


ATCGTTGGGA 


^Cf^r^f^ appt*p 
AlGGGAGCTG 


1260 


AATGAAGGCA 


TAPPA aarna 

I rtLLMJ-xrtLOM 


PP A P PP TP A P 


ACCACGATGC 


CTGCAGCAAT 


ggpa apa a pp 
vjulmhlAALu 


1320 


TTGCGCAAAC 


TATTAAPTGG 


PGA APT A PTT 
LoHAL I AL. 1 I 


A PTPTA PPTT 

AL rCTAGCTT 


CCCGGCAACA 


A TT n A T A P A P 
Mil AA 1 AG AL 


1380 


TGGATGGAGG 


PGP A T A A A PT 


Tpf ^ P"P > PP T\ 


CTTCTGCGCT 


CGGCCCTTCC 


1 vjIjL 1 LjVj 


144 0 


TTTATTGCTG 


AT A A ATPTPP 


Ti.rTTTT'Tf* A P 
A^jLLOLj i LjAo 


CGTGGGTCTC 


GCGGTATCAT 


TPP APP a ptp 
IVjLAvjLAL 1G 


1500 


GGGCCAGATG 


GTAAPPPPTP 
1 PW-VLL L L 1 L 


PPPT* & TPP T 7\ 


GTTATCTACA 


CGACGGGGAG 


TCAGGPAAPT 

a ^n\jvjv_rtr\L i 


iboO 


ATGGATGAAC 


GAAATAGAPA 


(~1 ATPPPTP AP 
on 1 L vj v_ 1 Vj A\j 


A 1 AGG TGCCT 


CACTGATTAA 


GCATTGGT A A 


icon 
lb ZU 


CTGTCAGACC 


AAGTTTAPTP 

txJ\\J 111 J-\L. 1 L 


ATATATQPTT 
A I A 1 A i AV_ 1 1 


TP A P 71 TTP A TT* 

I AGA 1 1 G ATT 


TAAAACTTCA 


TTTTT A A TTT 
l l 1 I 1 r\A ill 


16 8 0 




apptp a ap at 


PPTTTTTP T\ T» 


AATCTCATGA 


CCAAAATCCC 


TTA APPTP AP 


174 0 


TTTTPGT^PP 


APTGAPPPTP 

rtL 1 LJrtOL VJ 1 L 


AP RPPPfPT/l 


P A A ?l A P A T*/~» TV 

GAAAAGATCA 


AAGGATCTTC 


TTGAGATPPT 

* * \Jn\jt\ 1 LL 1 


i q n 


TTTTTTCTGC 


GPGTAATPTP 

vJLU 1 rtM IL ivj 


PTPPTTPPA a 
V- J. VjL, 1 I AA 


APAJ\A1\AAAP 

ALAAAAAAAC 


CACCGCTACC 


^VJLVJVj 1 L?V3 1 1 


1 Q C A 

1 o 6 0 


TGTTTGC^GG 


ATP A APAPPT 


APPA 5PTPTT 


I I TCCGAAGG 


TAACTGGCTT 


PAGP AP APPP 


i y z o 


CAGATAPPAA 


AT A PTPTPPT 
M 1 ML 1 Vj 1 V_ L 1 


TPTSPTPT7\P 
iLi PiSj I \j I AO 


PPP'P TV r"T"f * /-» 

CCGTAGTTAG 


GCCACCACTT 


PA AP A APTPT 


1 Q Q A 

i y 8 o 


GTAGPAPPGP 


PTAP ATAPPT 
L IMLrtiHLL 1 


PPPTPTPPT1\ 


ATCCTGTTAC 


CAGTGGCTGC 


TPPP APTPPP 
i oLLAu 1 LjvjL 


204 0 


\Jt\ 1 MjHVj I 1 


ulLl IALLLjLj 


PTTPP A PTP A 


AGACGATAGT 


TACCGGATAA 


LjuLoLAuLbu 


2100 


TPPPPPTP A A 


LkjVjVjL-VjLi 1 1L 


PTPP A P I\ P 7\ P 


CCCAGCTTGG 


AGCGAACGAC 


PT APA PPP A A 
v- 1ALALLGAA 




ptpap a^app 


i ALALjLG i uA 


LtL A I TG AG AA 


AGCGCCACGC 


TTCCCGAAGG 


PAPAA APP PP 
uH vj AAALjG L, G 


1 O -) A 


Gap a ppt "* tp 


pppta Afrrn 

LGG i AAGlGG 


CAGGGTCGGA 


AC AGG AG AG C 


GCACGAGGGA 


PPTTPPAPPP 
VjL 1 1 LLAuuo 


**» ~> □ A 


PP A A Qp^^PT 


VjG 1 A 1 G 1 1 1A 


T 7\ P r PPPT*/" , »»T'^ 

1 AGTCCTGTC 


GGGTTTCGCC 


ACCTCTGACT 


TP APPPTPP A 
a UnuLu 1 LLjA 


Z J H U 


TTTTTPTP A T 


nrTrrTra PP 

LJ<*- 1 LLj 1 L.A\jLj 


GGGGLGGAGC 


CTATGGAAAA 


ACGCCAGCAA 


PGPGPPPTTT 

LULUOL* ill 


it n c\ 

Z n U \J 


TT A PPPP^PP 


TPPPPTTTTP 


ptppp pn " i'» 1''^** 

l IGGUCTTTT 


GCTCACATGT 


TCTTTCCTGC 


PTTATPPPPT 
vj i lAlLLLLl 


z ^ o u 


GATTPTPTPP 


A 1 HAL LG 1 A i 


T* ^ PPPPr^T^T»rn 


GAGTGAGCTG 


ATACCGCTCG 


LLULMuLLUA 


Z 3 Z U 


APGIaPPP " PP 


bLAbLuAb I G 


AGTGAGCGAG 


GAAGCGGAAG 


AGCGCCTGAT 


PPPPT* ATTTT 
uLyvj 1 A 1 1 1 1 


Z 3 O U 


ptpptt ^ ^rr 

V- 1 LL 1 in^uU 


AlLibl Gl GG 


1 ATTILACAC 


CGCATATGGT 


GCACTCTCAG 


TAP A ATPTPP 
1 ML nrt 1 L 1 VJ L 


Z O 4 VJ 


TPTP A TPPPP 


PATAPTTA Ap 
Ln 1 AG 1 1 AAG 


PP ^ PT* TV ^T^ IV ^ 

G L AGTATAC A 


CTCCGCTATC 


GCTACGTGAC 


TGPPTP ATPP 

1 OUU 1 LA 1 OvJ 


Z / \J U 


PTGPGP^^PG 


APAPPPPPP A 
ALrtLLLuLLA 


n p t\ r~T^ pp ptp 

ALAGCGGCTG 


ACGCGCCCTG 


ACGGGCTTGT 


L 1 VJL 1 LLLL7VJ 


97CA 
Z f D U 


Lrt. 1 LLLjv. - 1 A 


P A P A P 7\ A P PT^ 

lAGAl AAGG I 


GTGACCGTCT 


CCGGGAGCTG 


CATGTGTCAG 


AGGTTTTPAP 
rvo\j ill i. L ML 


28 20 


V, O 1 Lrt 1 Li-ILL 


PA A APPPPPP 
uAAALuL-uLu 


& PPP 7\ PTPT T\ 

AGGGAG rCTA 


/"i ft * » ft ft 

GACAATAGTA 


GTACGGATAG 


CTGTGAPTPP 

V- l \J 1 UnL 1 LL 


2380 


PPTPPT^^TA 


AV-AlALG III 


i G AGAT AC AC 


CCGGTGGTCC 


CGCTGTGCCC 


P ATT A A APP A 
LAI IAAMLlA 




P TTP P ^ t *""P a 


uAb I I GG i ovj 


GCGTCGCCAG 


GCTGTGGAAT 


GTATCGAGGA 


PTTPPTTA AP 
\_ a 1 OL 1 1 MnL 




PAPPPTP^PP 
OAVjLL 1 i;uoL 


& n ppTTTrr a 
AALL 1 I iVjLjA 


G TTG AGCTGT 


AAACGCCCCA 


GGCCATAAGG 


TGTAAACCTG 


3060 


TGATTGCGTG 


TGTGGTTAAC 


GCCTTTGTTT 


GCTGAATGAG 


TTGATGTAAG 


TTTAATAAAG 


3120 


GGTGAGATAA 


TGTTTAACTT 


GCATGGCGTG 


TTAAATGGGG 


CGGGGCTTAA 


AGGGTATATA 


3180 
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iT0C=:C=T ° «™«tct gacgtcatgg AG 'CTOs,,,,,*, 

GATTTTTCTG CTGTGCGTAA CTTGC-GOAA ^ AGCCTT ^A GTGTTTGGAA .„„ 

«™» GGGGCTCATC CG^ ^ ™™ - 

—™ AAGAGCTTTT GAAATCCTGT GGTGAGCTGT „ G^^ 
CACCAGGCGC TTTTCCAAGA GAAGGTCAT, ^TGATTCTTT GAATCTGGGT 

GCTGCGGCTG CTGTTGCTT ^ ^ """"^ 

CTGAGCGGGG GGTACCTGCT ^ """"" """""" 

CACAAGAATC GCCTGCTACT G^ 

CAGCAGCAGC AGCAGGAGGA Z «~ 

AGAGCCGGCC TGGACC^TCG G^ 

GGAATGAATG TTGTACAGGT GGCTGAACTG T.rr 

TGAGACGCAT TTTCACAATT ACAGAGGATG GGCAGGGGCT ^ 

GGGGGGCTTG TGAGGCTACA GAGGAGGCTA GGAATCTAGC ^™ 

ACCGTCCTGA GTGTATTACT TTTC AACAGA TCAAGGATAA J£Z 

TGCTGGCGCA GAAGTATTCC ATAGAGCAGC TGACCACTTA G^ 

~A GGCTATTAGG GTATATGCAA AGGTGGCACT ^ 
AGATCAGCAA ACTTGTAAAT ATCAGGAATT GTTCGTAGAT ^ 
TGGAGATAGA TACGGAGGAT AGGGTGGCGT TTAGATGTAG C ^ 
CGGTGCTTGG GATGGACGGG ™ TTAGATGTAG CATGATAAAT ATGTGGCCGG 

CATGGACGGG GTGGTTATTA TGAATGTAAG GTTTACTGGG 
GCGGTACGGT TTTCCTGGGG AATACCAACC TTATCCTACA CGGTGTAAGC 
TTAACAATAC CTGTGTGGAA GCCTGGACCG ATGTAAGGGT ^ 7"""" 
GCTGCTGGAA GGGGGTGGTG TGTCGCCCCA AAAGCAGGGC ^Z^"" 
TTGAAAGG7G TACCTTrrr-r ^r- ^AATTAAG AAATGCCTCT 

GCTCCGACTG TGGTTGCTTC Z^ "CAATGTGG 
— C« CTGcZZg l MM ™ aC — CATAACATGG 

ACCTG'TGAA ^ ™"CGGAC ««AGTGTC 

. ~ :™; ~ c ~~ — ~ 

ACCAATGCAA TTTGAGTCAC ^ ~ — « 
ACCTGAACGG GGTGTTTGAC ATGACCATGA A^ ^ ^ 
■ ~= ~C TGCGAGTGTG GCGGTAAACA ^ 

TGCTGGATGT GACCGAGGAG CTGAGGCCCG ATCACTTGGT GCTGGCCTGC ^ 

AGTTTGGCTC TAGCGATGAA GATACAGATT GAGGTACTGA AATGTGTGGG 

GGGTGGGAAA GAATATATAA GGTGGGGGTC TTATGTAGTT TTGTATCTGT ^ 

CCGCCGCCGC CATGArrar*>- » r GTATCTGT TTTGCAGCAG 

CGCGCA7GCC ^ ^ TATTTGACAA 

GCCCCGTGCT Z~ ~ 
GCCCGCAAAC TCTACTACGT TGACCTACGA GACCGTGTCT GGAAGGCGGT 



3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
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TGGAGACTGP. 


AGCCTCC^CC 

/1VJL V_ I L V- k-V- L 


jlLuv, i * V-Mvj 


CCGCTGCAGC 


CACCGCCCGC 


GGGATTGTGA 


5340 


CTGACTTTGr 

v. ivjn^ i j. l vj v. 


1 1 A v_ v_ 1 vJ/iVJ V, 


LLUL 1 lvjv_AA 


GCAGTGCAGC 


TTCCCGTTCA 


TCCGCCCGCG 


5400 


ATGACAAGTT 


GACGGPTP"*"** 

VJrtLVJVJL Ik. 4 . 


TTnnp/Apa &t 

1 l vjvjLML HA 1 


TGGATTCTTT 


GACCCGGGAA 


CTTAATGTCG 


5460 


TTTCTCAGCA 


GCTGTTGGAT 


CTGCGCCS.Gr 

L ivJLLiLLrtvjL 


Aljvj I i X CTGC 


CCTGAAGGCT 


TCCTCCCCTC 


5520 


CCAATGCGGT 


TTAAAACATA 


AATAAAAAAP 


L/AvjAL i v_ 1 L? i 


TTGGATTTGG 


ATCAAGCAAG 


5580 


TGTCTTGCTG 


TCTTTATTTA 


GGGGTTTTGP 

vwuvj A X X X VJL 


UV-\jLvJ\.VjVj i. A 


GbLLCGGGAC 


CAGCGGTCTC 


5640 


GGTCGTTGAG 


GGTCCTGTGT 


ATTTTTTPPA 

" a 1111 1LLH 


vj^jAv. vj 1 Vjvj I A 


AAGGTGACTC 


TGGATGTTCA 


5700 


GATACATGGG 


CATAAGCCCG 


TCTPTPPpPT 

1 L 1 L X LJ L7V_3\J 1 


VjvjAvjvj 1 A vjl. A 


LLACTGCAGA 


GCTTCATGCT 


5760 


GCGGGGTGGT 


GTTGTAGATG 


ATPPaPTPPT 
X LLnu 1 L vj 1 


app app 

M vj L Avj Vj Avj v_ vj 


LTGGGCGTGG 


TGCCTAAAAA 


5820 


TGTCTTTCAG 


TAGCAAGCTG 


ATTnppannn 

r\ l 1ULV.MVJUU 


p.pap.p.p^ptt 
vjLAvjv3v_v_l 1 1 


vjGTGTAAGTG 


TTTACAAAGC 


5880 


GGTTAAGCTG 


GGATGGGTGP 


ATA PPTPPPP 

M X >\L Ll 1 vj vjvjvj 


A *T* A Tp ApJi TV* 
A 1 A 1 u AvjA I vj 


CATCTTGGAC 


TGTATTTTTA 


5940 


GGTTGGCTAT 


GTTCCCAGCC 

VJ 4. I V— V> Vw VJ \_ V_ 


ATATPPPTPP 

«lr\XLV_L ILL 


PPPriTTPBT 
vjvjvjvjA 1 1 LA 1 


GTTGTGCAGA 


ACCACCAGCA 


6000 


CAGTGTATCC 


GGTGCACTTG 

WW A WV.^1V_ X 1 VJ 


GG AAA TTTPT 


LAI vj 1 AvjL i r 


AGAAGGAAAT 


GCGTGGAAGA 


6060 


ACTTGGAGAC 


GCCCTTGTGA 


PPTPP AAPfiT 
LL I LLMMLiM 1 


TTTPP''! TPPA 

111 ULA 1 vjC-A 


TTCGTCCATA 


ATGATGGCAA 


6120 


TGGGCCCACG 


GGCGGCGGCC 

VJ VJ L VJ VJ V_ VJ VJ L L 


l vjLjvjL vjMMvjAV 


IA1 TTCTGGG 


ATCACTAACG 


TCATAGTTGT 


6180 


GTTCCAGGAT 


nAn ATPPTPn 

VJrtvJJA X v- vJ 1 LM 


T A /T^ f™ 1 7A TTT 
lnyuLv-Ai 1 i 


1 TACAAAGCG 


CGGGCGGAGG 


GTGCCAGACT 


6240 


GCGGTATAAT 

VJ V. VJ VJ A 1. /"Vf! X 


G G T T P P A T P P 
vjvj 1 l\_Lrtlv,L 


vJvjU L l Alivjvjli 


CGTAGTTACC 


CTCACAGATT 


TGCATTTCCC 


6300 


ACGCTTTGAG 

n\.iju i i j. vj^\vj 




vjovjM. 1 1„ A 1 vj 1 


LTACCTGCGG 


GGCGATGAAG 


AAAACGGTTT 


6360 


CCGGGGTAGG 

V— VJ VJVJVJ 1 rtVJ VJ 


VJVJrtVJrt 1 LrtU L 


1 vj Vjvj H/\b AAA 


GlAGGTTCCT 


GAGCAGCTGC 


GACTTACCGC 


6420 


AGCCGGTGGG 


pppp.ta aatp 


AlAll 1A1 I A 


LLGGGTGCAA 


CTGGTAGTTA 


AGAGAGCTGC 


6480 


iVVJ \, 1UV.V. VJ X l 


nlLLL 1 VJrtvj L 


A vj vj vJ vj vj \j L L A 


CTTCGTTAAG 


CATGTCCCTG 


ACTCGCATGT 


6540 


TTTCCCTGAC 

A 1. A V_ V_ V_ A VJrtV_ 


L f\J-\J-\ ILLuLL 


JPA A CC CC 
rtvj^vrtbvjLuv. 1 


LGCCGCCCAG 


CGATAGCAGT 


TCTTGCAAGG 


6600 


AAGCAAAGTT 


ill LrtMLuu 1 


1 I vjALjALv_vj I 


CCGCCGTAGG 


CATGCTTTTG 


AGCGTTTGAC 


6660 


CAAGPAGTTP 

V_,fVrAVJLrtVJ 1 X L 


CT^GGCGGTCr 
LrtvjvjLvjvj ILL 


v.Av_Avjk_ 1 l vjVj 


TCACCTGCTC 


TACGGCATCT 


CGATCCAGCA 


6720 


TATCTCCTCG 

A A L A LL X L VJ 


TTTPP.PnP.PT 
111 L vjLvjvjvj i 


1 vjuvjvjLvjvjL 1 


ITCGCTGTAC 


GGCAGTAGTC 


GGTGCTCGTC 


6780 


CAGACGGGPP 

v_r^\j>^k\_ VJVJVJ\_ L 


AP G G T P A TP T 
rVvjvjvj i Lrt 1 Vj i 


Llll v_v_Av,Vjvj 


GCGCAGGGTC 


CTCGTCAGCG 


TAGTCTGGGT 


6840 


CACGGTG7VAG 


GGGTGCCXCVC 

vJVJVJ i VJL vjL X L 


LuobL lvjL-Vjv. 


GCTGGCCAGG 


GTGCGCTTGA 


GGCTGGTCCT 


6900 


GCTGGTGCTG 

V—J V X VJVJ A VJ V_ X VJ 


r\f\\J L VJ L 1 vjL L 


vjvjIlI iv-vjv.V_ 


CTGCGCGTCG 


GCCAGGTAGC 


ATTTG AC CAT 


6960 


GGTGTCATAG 

VJ VJ 1 u 1 L -rt X /"\VJ 


Tpr apppppt 

L LLMuLLLL 1 


v.v»vjv_vjvjLvj i Lj 


GLCCTTGGCG 


CGCAGCTTGC 


CCTTGGAGGA 


7020 


GGCGCCGCAC 


n apppppapt 

vjAvjvjovjl AvJ I 


vjLAvjAlI 1 TT 


GAGGGCGTAG 


AGCTTGGGCG 


CGAGAAATAC 


7080 


CGATTPPGGG 

Lvjrtl 1 v.\.VJVJVJ 


g ap t a pnp a *** 

OrtO 1 MvjvjLH. a 


l v»vjv_vjL LvjL A 


GGCCCCGCAG 


ACGGTCTCGC 


ATTCCACGAG 


7140 


CCAGGTGAGG 

ll/^vjvj i vjnvj^. 


tptppppptt 

X L 1 VJVJL L vj 1 I 


V-vJOvjvj i L AAA 


AACCAGGTTT 


CCCCCATGCT 


TTTTGATGCG 


7200 


TTTCTTACCT 


CTGGTTTCCA 


TGAGCCGGTG 


7CCACGCTCG 


GTGACGAAAA 


GGCTGTCCGT 


7260 


GTCCCCGTAT 


ACAGACTTGA 


GAGGCCTGTC 


CTCGAGCGGT 


GTTCCGCGGT 


CCTCCTCGTA 


7320 


TAGAAACTCG 


GACCACTCTG 


AGACAAAGGC 


TCGCGTCCAG 


GCCAGCACGA 


AGGAGGCTAA 


7380 
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GTGGGAGGGG TAGCGGTCGT TGTCCACTAG GGGGTCCACT CGCTCCAGGG TGTGAAGACA 74 40 

CATGTCGCCC TCTTCGGCAT CAAGGAAGGT GATTGGTTTG TAGGTGTAGG CCACGTGACC 7500 

GGGTGTTCCT GAAGGGGGGC TATAAAAGGG GGTGGGGGCG CGTTCGTCCT CACTCTCTTC 7 56 0 

CGCATCGCTG TCTGCGAGGG CCAGCTGTTG GGGTGAGTAC TCCCTCTGAA AAGCGGGCAT 7 620 

GAC7TCTGCG CTAAGATTGT CAGTTTCCAA AAACGAGGAG GATTTGATAT TCACCTGGCC 7680 

CGCGGTGATG CCTTTGAGGG TGGCCGCATC CATCTGGTCA GAAAAGACAA TCTTTTTGTT 774 0 

GTCAAGCTTG GTGGCAAACG ACCCGTAGAG GGCGTTGGAC AGCAACTTGG CGATGGAGCG 7800 

CAGGGTTTGG TTTTTGTCGC GATCGGCGCG CTCCTTGGCC GCGATGTTTA GCTGCACGTA 7860 

TTCGCGCGCA ACGCACCGCC ATTCGGGAAA GACGGTGGTG CGCTCGTCGG GCACCAGGTG 7920 

CACGCGCCAA CCGCGGTTGT GCAGGGTGAC AAGGTCAACG CTGGTGGCTA CCTCTCCGCG 7980 

TAGGCGCTCG TTGGTCCAGC AGAGGCGGCC GCCCTTGCGC GAGCAGAATG GCGGTAGGGG 804 0 

GTCTAGCTGC GTCTCGTCCG GGGGGTCTGC GTCCACGGTA AAGACCCCGG GCAGCAGGCG 8100 

CGCGTCGAAG TAGTCTATCT TGCATCCTTG CAAGTCTAGC GCCTGCTGCC ATGCGCGGGC 8160 

GGCAAGCGCG CGCTCGTATG GGTTGAGTGG GGGACCCCAT GGCATGGGGT GGGTGAGCGC 8220 

GGAGGCGTAC ATGCCGCAAA TGTCGTAAAC GTAGAGGGGC TCTCTGAGTA TTCCAAGATA 8280 

TGTAGGGTAG CATCTTCCAC CGCGGATGCT GGCGCGCACG TAATCGTATA GTTCGTGCGA 834 0 

GGGAGCGAGG AGGTCGGGAC CGAGGTTGCT ACGGGCGGGC TGCTCTGCTC GGAAGACTAT 8400 

CTGCCTGAAG ATGGCATGTG AGTTGGATGA TATGGTTGGA CGCTGGAAGA CGTTGAAGCT 84 60 

GGCGTCTGTG AGACCTACCG CGTCACGCAC GAAGGAGGCG TAGGAGTCGC GCAGCTTGTT 852 0 

GACCAGCTCG GCGGTGACCT GCACGTCTAG GGCGCAGTAG TCCAGGGTTT CCTTGATGAT 8580 

GTCATACTTA TCCTGTCCCT TTTTTTTCCA CAGCTCGCGG TTGAGGACAA ACTCTTCGCG 864 0 

GTCTTTCCAG TACTCTTGGA TCGGAAACCC GTCGGCCTCC GAACGGTAAG AGCCTAGCAT 8700 

GTAGAACTGG TTGACGGCCT GGTAGGCGCA GCATCCCTTT TCTACGGGTA GCGCGTATGC 8760 

CTGCGCGGCC TTCCGGAGCG AGGTGTGGGT GAGCGCAAAG GTGTCCCTGA CCATGACTTT 8820 

GAGGTACTGG TATTTGAAGT CAGTGTCGTC GCATCCGCCC TGCTCCCAGA GCAAAAAGTC 8880 

CGTGCGCTTT TTGGAACGCG GATTTGGCAG GGCGAAGGTG ACATCGTTGA AGAGTATCTT 8940 

TCCCGCGCGA GGCATAAAGT TGCGTGTGAT GCGGAAGGGT CCCGGCACCT CGGAACGGTT 9000 

GTTAATTACC TGGGCGGCGA GCACGATCTC GTCAAAGCCG TTGATGTTGT GGCCCACAAT 9060 

GTAAAGTTCC AAGAAGCGCG GGATGCCCTT GATGGAAGGC AATTTTTTAA GTTCCTCGTA 9120 

GGTGAGCTCT TCAGGGGAGC TGAGCCCGTG CTCTGAAAGG GCCCAGTCTG CAAGATGAGG 9180 

GTTGGAAGCG ACGAATGAGC TCCACAGGTC ACGGGCCATT AGCATTTGCA GGTGGTCGCG 9240 

AAAGGTCCTA AACTGGCGAC CTATGGCCAT TTTTTCTGGG GTGATGCAGT AGAAGGTAAG 9300 

CGGGTCTTGT TCCCAGCGGT CCCATCCAAG GTTCGCGGCT AGGTCTCGCG CGGCAGTCAC 9360 

TAGAGGCTCA TCTCCGCCGA ACTTCATGAC CAGCATGAAG GGCACGAGCT GCTTCCCAAA 9420 

GGCCCCCATC CAAGTATAGG TCTCTACATC GTAGGTGACA AAGAGACGCT CGGTGCGAGG 94 80 
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ATGCGAGCCG 

GTGGTGAAAG 
TGCGCAGTAC 
GCGCACAAGG 
TTCTACTTCG 
GACCACCACG 
GACAACATCG 
CGGGAGCTCC 
ATACCTAATT 
CCGCGGCGCG 
TGCATCTAAA 
GGGAGAGGGG 
AGGTTGCTGG 
AAGACGACGG 
TCGTTGACGG 
ATCTCGGCCA 
ACGGTGGCGG 
CCCTCGTTCC 
ACCTGCGCGA 
AAGAGGTAGT 
CGCAACGTGG 
TCCACGGCGA 
AGACGGATGA 
TCTTCTTCTT 
GGGGGAGGGG 
ATCATCTCCC 
CGCAGTTGGA 
GGCAGGGATA 
AGGGACCTGA 
CAGTCACAGT 
TTGTTTCTGG 
ATGGTCGACA 
ATGCCCCAGG 
TCTACCGGGA 
GCGGCGGCGG 



ATCGGGAAGA 
TAGAAGTGCC 
TGGCAGCGGT 
AAGCAGAGTG 
GCTGCTTGTG 
CCGCGCGAGG 
GGGAGATGGG 
TGCAGGTTTA 
TCCAGGGGCT 
ACTACGGTAC 
AGCGGTGACG 
GCAGGGGGAC 
CGAACGCGAC 
GGCCGGTGAG 
CGGCCTGGCG 
TGAAGTGCTG 
CGAGGTCGTT 
AGACGCGGCT 
GATTGAGCTC 
TGAGGGTGGT 
ATTCGTTGAT 
AGTTGAAAAA 
GCTGGGCGAC 
CAATCTGCTC 
GGACACGGCG 
CGCGGCGACG 
AGACGCCGCC 
GGGCGCTAAC 
GCGAGTCCGC 
GGCAAGGTAG 
CGGAGGTGCT 
GAAGCAGCAT 
CTTCGTTTTG 
CTTCTTCTTC 
AGTTTGGCCG 



AGTGGATCTC 
TGCGACGGGC 
GCACGGGCTG 
GGAATTTGAG 
CTTGACCGTC 
CCAAAGTCCA 
AGCTGTCCAT 
CCTCGGATAG 
GGTTGGTGGC 
CGCGGGGCGG 
CGGGCGAGCG 
GTCGGCGCCG 
GACGCGGCGG 
CTTGAGCGTG 
CAAAATCTCG 
GATCTCTTCC 
GGAAATGCGG 
GTAGACCACG 
CACGTGCCGG 
GGGGGTGTGT 
ATCCCCCAAG 
CTGGGAGTTG 
AGTGTCGCGC 
TTCCATAAGG 
GCGACGAGGG 
GCGCATGGTG 
CGTCATGTCC 
GATGCATCTC 
ATCGACCGGA 
GCTGAGCACC 
GCTGATGATG 
GTCCTTGGGT 
ACATCGGCGC 
TCCTTCCTCT 
TAGGTGGCGC 



CCGCCACCAA 
CGAACACTCG 
TACATCCTGC 
CCCCTCGCCT 
TGGCTGCTCG 
GATGTCCGCG 
GGTCTGGAGC 
ACGGGTCAGG 
GGCGTCGATG 
GCGGTGGGCC 
CCCGGAGGTA 
CGCGCGGGCA 
TTGATCTCCT 
AAAGAGAGTT 
TGCACGTCTC 
TCCTGGAGAT 
GCCATGAGCT 
CCCCCTTCGG 
GCGAAGACGG 
TCTGCCAGGA 
GCCTGAAGGC 
CGCGCCGACA 
ACCTCGCGGT 
GCCTCCGCTT 
CGCACCGGGA 
TCGGTGACGG 
CGGTTATGGG 
AACAATTGTT 
TCGGAAAACC 
GTGGCGGGCG 
TAATTAAAGT 
CCGGCCTGCT 
AGGTCTTTGT 
TGTCCTGCAT 
GCTCTTCCTC 



TTGGAGGAGT 
TGCTGGCTTT 
ACGAGGTTGA 
GGCGGGTTTG 
AGGGGAGTTA 
CGCGGCGGTC 
TCGCGCGGCG 
GCGCGGGCTA 
GCTTGCAAGA 
GCGGGGGTGT 
GGGGGGGCTC 
GGAGGTGGTG 
GAATGTGGCG 
CGACAGAATC 
CTGAGTTGTC 
CTCCGCGTCC 
GCGAGAAGGC 
CATCGCGGGC 
CGTAGTTTCG 
AGAAGTACAT 
GCTCCATGGC 
CGGTTAAGTC 
CAAAGGGTAC 
CTTCTTCTTC 
GGCGGTCGAC 
CGCGGCCGTT 
TTGGCGGGGG 
GTGTAGGTAC 
TCTCGAGAAA 
GCAGCGGGCG 
AGGCGGTCTT 
GAATGCGCAG 
AGTAGTCTTG 
CTCTTGCATC 
CCATGCGTGT 



PCTAJS97/I9541 

GGCTATTGAT 9 54 0 

TGTAAAAACG 9600 

CCTGACGACC 9660 

GCTGGTGGTC 9 720 

CGGTGGATCG 9780 

GGAGCTTGAT 984 0 

TCAGGTCAGG 9900 

GATCCAGGTG 9960 

GGCCGCATCC 10020 

CCTTGGATGA 10080 

CGGACCCGCC 10140 

CTGCGCGCGT 10200 

CCTCTGCGTG 10 260 

AATTTCGGTG 10 3 20 

TTGATAGGCG 10380 

GGCTCGCTCC 104 4 0 

GTTGAGGCCT 10 500 

GCGCATGACC 10 560 

CAGGCGCTGA 106 2 0 

AACCCAGCGT 1068 0 

CTCGTAGAAG 1074 0 

CTCCTCCAGA 108 00 

AGGGGCCTCT 10860 

TGGCGGCGGT 10920 

AAAGCGCTCG 10980 

CTCGCGGGGG 1104 0 

GCTGCCATGC 11100 

TCCGCCGCCG 11160 

GGCGTCTAAC 11220 

GCGGTCGGGG 11280 

GAGACGGCGG 11340 

GCGGTCGGCC 114 00 

CATGAGCCTT 114 6 0 

TATCGCTGCG 11520 

GACCCCGAAG 11580 
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GGAGGCAGCT 


GGGGCCGGAC 


CTGGGCTGGC 


GGTGGCACCP 




UCAACGTCGG 


13*74 0 


CGGCGTGGAG 


GAATATGACG 


AGGACGATGA 


GTACGAGCCA 


\Jn\j\jM\_\jvjL\J 


v-Aj I AL i AAGv_ 


13900 


GGTGATGTTT 


CTGATCAGAT 


GATGCAAGAC 


GCAACGGACC 


cggcggtgpp 


vjvjLvjvjLvjL I tj 


13 860 


CAGAGCCAGC 


CGTCCGGCCT 


TAACTCCACG 


GACGAC7GGC 


GCCAGGTCAT 


PP APPP*"* ATr 

vjvjrtv_v_ \jv_H I v_ 


13 920 


ATGTCGCTGA 


CTGCGCGCAA 


TCCTGACGCG 


TTCCGGCAGC 


agcpgpaggp 


V-/VAv_v_vjvjv_ A (_ 


13 980 


TCCGCAATTC 


TGGAAGCGGT 


GGTCCCGGCG 


CGCGCAAACC 


ccapgpapga 

v— v- v_ VJ V, r\V_ 


vj>-\>-\vjvj l vj L. i vj 


14 04 0 


GCGATCGTAA 


ACGCGCTGGC 


CGAAAACAGG 


GCCATCCGGC 


CPGAPGAGGP 


p* n p^ p"tp* p"r p* 

v_Vj\jv_V_ Ivjvj l v_ 


14100 


TACGACGCGC 


TGCTTCAGCG 


CGTGGCTCGT 


TACAACAGCG 


GCAACGTGCA 




1416 0 


GACCGGCTGG 


TGGGGGATGT 


GCGCGAGGCC 


GTGGCGCAGC 


GTGAGPGPGC 

VJ 1 VJ ./"V VJ V_ VJ V_. \J V_ 


PP AGPZiPP'A o 

vj \- e\ vj \_ rt Vj V_ i\ Vj 


14 220 


GGCAACCTGG 


GCTCCATGGT 


TGCACTAAAC 


GCCTTCCTGA 


GTAPAPAGPP 


v_ uLLRHL Vj 1 Vj 


14 28 0 


CCGCGGGGAC 


AGGAGGACTA 


CACCAACTTT 


GTGAGCGCAC 


TGCGGCTAAT 

X VV>\J\JV« X f\f^ X 


PPTG 21 PTr* a r 1 

vJ\J i uML i oAu 


14 3 4 0 


ACACCGCAAA 


GTGAGGTGTA 


CCAGTCTGGG 


CCAGACTATT 


TTTTCCAGAC 




14400 


GGCCTGCAGA 


CCGTAAACCT 


GAGCCAGGCT 


TTCAAAAACT 


TG C AGGGGPT 


VJ 1 VjvjvjVjvJvJ I Vj 


14 46 0 


CGGGCTCCCA 


CAGGCGACCG 


CGCGACCGTG 


TCTAGCTTGC 


TGACGCCCAA 


PTPPPPPPTP 
V— i V- vj Luv, v_ 1 VJ 


14 52 0 


TTGCTGCTGC 


TAATAGCGCC 


CTTCACGGAC 


AGTGGCAGCG 


tgtpcpggga 


v- H.v_ M 1 Av_ v_ 1 jH 


14 580 


GGTCACTTGC 


TGACACTGTA 


CCGCGAGGCC 


ATAGGTCAGG 


CGCATGTPGA 

\.VJV.V\ X VJ 1 VjVJrt 




1464 0 


TTCCAGGAGA 


TTACAAGTGT 


CAGCCGCGCG 


CTGGGGCAGG 


apgapapppp 


v_Avjv_v_ IvjvjAvj 


14 700 


GCAACCCTAA 


ACTACCTGCT 


GACCAACCGG 


CGGCAGAAGA 


TCCCCTPGTT 


PP AP ariTTTi 

vjv_MV_M.vj 1 I 1 A 


1 4 76 0 


AACAGCGAGG 


AGGAGCGCAT 


TTTGCGCTAC 


GTGPAGPAGA 


vjv,\j 1 un\Jv.L I 


1 jH_H.v_ v, i IjA i b 


1482 0 


CGCGACGGGG 


TAACGCCCAG 


CGTGGCGCTG 


GACATGACCG 


CGPGPAAPAT 


VJVJ M/\V_ V_ vJVJVJ V- 


14 88 0 


ATGTATGCCT 


CAAACCGGCC 


GTTTATCAAC 


CGCCTAATGG 


ACTAPTTPPA 


Trr.ppppppr 

1 V-VJV-VJV_\JvJV_V_ 


14 940 


GCCGTGAACC 


CCGAGTATTT 


CACCAATGCC 


ATCTTGAACC 


v_- VJ V_, rA.V_ XVjVjV- 1. 


APPPf^PTTT'T 
rtLv-vjv_LL\_L x 


i c n n n 


GGTTTCTACA 


CCGGGGGATT 


CGAGGTGCCC 


GAGGGTAACG 


ATGGATTPPT 


v_ 1 vjvjvjALvjAL 


1 hUb U 


ATAGACGACA 


GCGTGTTTTC . 


CCCGCAACCG 


C AG AC CCTGC 


TAGZlPTTPPIi 
x rtortvj 1 x vj v_ ^\ 


M.v» AVj LLiL vj Avj 


15120 


CAGGCAGAGG 


CGGCGCTGCG 


AAAGGAAAGC 


TTC CGCAGGC 


PAAGPAGPTT 


PTPPP 7\ TPT A 

vj i l_ v_vjA ILIA 


1 D IS 0 


GGCGCTGCGG 


CCCCGCGGTC 


AGATGCTAGT 


AGCCCATTTC 


PAAGPTTGAT 


APPPTPTfTT 
rtvjvjvj 1 v» I v_, 1 I 


1 j ^ 4 U 


ACCAGCACTC 


GCACCACCCG 


CCCGCGCCTG 


CTGGGCGAGG 


AGGAGTAPPT 

r^vj/vjnvj x ^\v_ v_ X 


A A AP A aPTPP 


1 j J U U 


CTGCTGCAGC 


CGCAGCGCGA 


AAAAAACCTG 


CCTCCGGCAT 


TTCCCAACAA 


PPPP AT APAP 

V_VJVJVJ^-V 1 MVJrtVJ 




AGCCTAGTGG 


ACAAGATGAG 


TAGATGGAAG 


ACGTACGCGC 


AGGAGPAPAG 


ppapptppph 

vjvj>\v-.VJ 1 vjLLn 


1 J 4 4 U 


GGCCCGCGCC 


CGCCCACCCG 


TCGTCAAAGG 


CACGACCGTC 


AGCGGGGTCT 


GGTGTPPPAP 

VJVJ i VJ X VJVjVJrtVJ 




GACGATGACT 


CGGCAGACGA 


CAGCAGCGTC 


CTGGATTTGG 


GAGGGAGTGG 

vjnvjvjvjnvj x vjvj 


P A A P PPPTTT 
v_rt>\\_. V_V_VJ ill 


ID34U 


GCGCACCTTC 


GCCCCAGGCT 


GGGGAGAATG 


TTTTAAAAAA 


AAAAAAGPAT 


G A TG P A A A TA T 


1 jOu v 


AAAAAAC aCA 


CCAAGGCCAT 


GGCACCGAGC 


GTTGGTTTTC 


TTGTATTCCC 


CTTAGTATGC 


15660 


GGCGCGCGGC 


GATGTATGAG 


GAAGGTCCTC 


CTCCCTCCTA 


CGAGAGTGTG 


GTGAGCGCGG 


15720 


CGCCAGTGGC 


GGCGGCGCTG 


GGTTCTCCCT 


TCGATGCTCC 


CCTGGACCCG 


CCGTTTGTGC 


15780 
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CTCCGCCCTA CCTGCGGCCT ACCGGGGGGA GAAACAGCAT ^ PCT ^97/1954, 

— «™ „ :=: - 

■ AAG.AGGA CAACGACGAC AGCAAC^C TGACGAGGG, CATTCAAAAC AATGACTACA 
GCCGGGGGGA GGGAAGCACA GAGACCATGA A TCTO ACGA CCGG.GGGAC TGGGG^ 
CXOAAAAC CA,™ ACCAACATGC CAAATGTGAA GGAG^ATG ™ ^ 
™ C =C 3C« „C T TGCCTACTAA GGACAATCAG GXOG ^ 

::::::: :::r c ~° 6to — ~ 

. TATGAA CAACGCGATC GTGGAGCACT ACTTGAAAGT GGGCAGACAG AACGGGGTT" 

~» CATCGGGGTA AAGTTTGACA GCGGGAAGT, CAGACTGGGG ^ 
TCACTGGTCT «C CT GGGGTATATA CAAACGAAGC C^GA.CCA GACATCATTT 

. -GCCAGG ATGCGGGGTG GA^CACGC ACAGCCGCG, GAGCAACTTG TTGGGCATCC 
AAGGGGCA ACGC„CCAG GAGGGCTTTA GGA.CACC.A CGATGATCTG GAGGGTGGTA 

' 4mTTGMT ACC ™ ~ GACACCGAAC 

ACGCG ' GGCAGC " CA 5 " GTG0CAC ™™ 

.CGGGGCAGC CGCGGCAATG CAGCCGG1GG AGGACATGAA CGATCATGCC ATTCGCGGCG 

ACACCTTTGC CACACGGGCT GAGGAGAAGG GCGC.GAGGC CGAAGCAGGG GCCGAAGCTG 

CCGGCGGCGC TGCGCAACCC GAGGTCGAGA AGCCTCAGAA GAAACCGGTG ATCAAACCCC 

TGACAGAGGA CAGCAAGAAA CGCAGTTACA ACCTAATAAG CAATGACAGG ACCX.CACGC 

AG.ACCGCAG Cr m GCAXAGAAG, ACGGGGACGC T GAGAGCGGA A T CGGC T GA T 

GGACCCTGCT ^GGACCG, GACGTAACCT GCGGCTCGGA GCAGGTCTAC „ TGC 

CAGACATGAT GCAAGAGCCC GXGACG^C GG.CCACGCG CCAGATCAGC AACTTTCCGG 

TGGTGGGCGC CGAGCTGTTG CCCGTGCACT CCAAGAGCTT CTACAACGAG CAGGCGGTCT 

ACTCCCAACT CATCCGCCAG TTTACCTCTC TGACCCACGT GTTCAATCGC TTTCCCGAGA 

ACCAGATTTT GGCGGGCCCG CCAGCCCCCA CCATCACGAC CGTCAGTGAA AACGTTCCTG 

■CTGTGAGAGA «ACG CTACCGCTGC GCAACAGCAT CGGAGGAGTC CAGCGAGTGA 

CCATTACTGA CGCCAGACGC CGCAGGXGCC CCTACGTTTA CAAGGCCCTG GGCATAGTCT 

CGCCGCGCGT CG„C CGCACT^ GAGGAAGCAT G T GCA TOTT AXA.CGCGCA 

GCAATAACAC AGGCTGGGGC C^CC CAAGCAAGAT GTTTGGCGGG GCGAAGAAGC 

CCTGCGACCA AGACCCAGTG CGCGTGCGCG GGCACTACCG CGCGCCCTGG GGCGGGCACA 

AACGCGGCCG CAC« ACGAGGGXGG ATGACGCCAT CGACGCGGTG GTGGAGGAGG 

- CGCGCAACTA CACGCGCACG CCGCCACCAG TGTCCACAGT GGACGCGGCC ATTCAGACCG 

1GGTGCGCGG AGCCCGGCGC TATGCTAAAA TGAAGAGACG GCGGAGGCGC CTAGCACGTC 

GCCACCGCCG CCGACCCGGC ACTGCCGCCC AACGCGCGGC GGCGGCCCTG CTTAACCGCG 

CACGTGGGAG CGGCCGACGG GCGGCCATGC GGGCCGCTCG AAGGCTGGGG GCGGGTATTG 

TGACTGTGCC GGCGAGGTGC AGGCGACGAG CGGCCGCCGC AGCAGCCGCG GCCATTAGTG 
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CTATGAC7CA 


GGGTCGCAGG 


GGCAACGTGT 


GCGTGCCCGT 


GCGCACCCGC 


CCCCCGCGCA 


CGTACTGTTG 


TATGTATCCA 


GCGGCGGCGG 


TCAAAGAAGA 


GATGCTCCAG 


GTCATCGCGC 


AGCAGGATTA 


CAAGCCCCGA 


AAGCTAAAGC 


ATGAACTTGA 


CGACGAGGTG 


GAACTGCTGC 


GGAAAGGTCG 


ACGCGTAAAA 


CGTGTTTTGC 
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ACGCTGGACC TGCCTCCCCC CGCCGACACC CAGCAGAAAC CTGTGCTGCC AGGCCCGACC 
GCCGTTGTTG TAACCCGTCC TAGCCGCGCG TCCCTGCGCC GCGCCGCCAG CGGTCCGCGA 
TCGTTGCGGC CCGTAGCCAG TGGCAACTGG CAAAGCACAC TGAACAGCAT CGTGGGTCTG 
GGGGTGCAAT CCCTGAAGCG CCGACGATGC TTCTGAATAG CTAACGTGTC GTATGTGTGT 
CATGTATGCG TCCATGTCGC CGCCAGAGGA GCTGCTGAGC CGCCGCGCGC CCGCTTTCCA 
AGATGGCTAC CCCTTCGATG ATGCCGCAGT GGTCTTACAT GCACATCTCG GGCCAGGACG 
. CCTCGGAGTA CCTGAGCCCC GGGCTGGTGC AGTTTGCCCG CGCCACCGAG ACGTACTTCA 
GCCTGAATAA CAAGTTTAGA AACCCCACGG TGGCGCCTAC GCACGACGTG ACCACAGACC 
GGTCCCAGCG TTTGACGCTG CGGTTCATCC CTGTGGACCG TGAGGATACT GCGTACTCGT 
ACAAGGCGCG GTTCACCCTA GCTGTGGGTG ATAACCGTGT GCTGGACATG GCTTCCACGT 
ACTTTGACAT CCGCGGCGTG CTGGACAGGG GCCCTACTTT TAAGCCCTAC TCTGGCACTG 
CCTACAACGC CGTGGCTCCC AAGGGTGCCC CAAATCCTTG CGAATGGGAT GAAGCTGCTA 20760 
CTGCTCTTGA AATAAACCTA GAAGAAGAGG ACGATGACAA CGAAGACGAA GTAGACGAGC 20820 
AAGCTGAGCA GCAAAAAACT CACGTATTTG GGCAGGCGCC TTATTCTGGT ATAAATATTA 
CAAAGGAGGG TATTCAAATA GGTGTCGAAG GTCAAACACC TAAATATGCC GATAAAACAT 
TTCAACCTGA ACCTCAAATA GGAGAATCTC AGTGGTACGA AACTGAAATT AATCATGCAG 
CTGGGAGAGT CCTTAAAAAG ACTACCCCAA TGAAACCATG TTACGGTTCA TATGCAAAAC 
CCACAAATGA AAATGGAGGG CAAGGCATTC TTGTAAAGCA ACAAAATGGA AAG CT AG AAA 
GTCAAGTGGA AATGCAATTT TTCTCAACTA CTGAGGCGAC CGCAGGCAAT GGTGATAACT 
TGACTCCTAA AGTGGTATTG TACAGTGAAG ATGTAGATAT AGAAACCCCA GACACTCATA 
TTTCTTACAT GGCCACTATT AAGGAAGGTA ACTCACGAGA ACTAATGGGC CAACAATCTA 21300 
TGCCCAACAG GCCTAATTAC ATTGCTTTTA GGGACAATTT TATTGGTCTA ATGTATTACA 21360 
ACAGCACGGG TAATATGGGT GTTCTGGCGG GCCAAGCATC GCAGTTGAAT GCTGTTGTAG 21420 
ATTTGCAAGA CAGAAACACA GAGCTTTCAT ACCAGCTTTT GCTTGATTCC ATTGGTGATA 21480 
GAACCAGGTA CTTTTCTATG TGGAATCAGG CTGTTGACAG CTATGATCCA GATGTTAGAA 21540 
TTATTGAAAA TCATGGAACT GAAGATGAAC TTCCAAATTA CTGCTTTCCA CTGGGAGGTG 21600 
TGATTAATAC AGAGACTCTT ACCAAGGTAA AACCTAAAAC AGGTCAGGAA AATGGATGGG 21660 
. AAAAAGATGC TACAGAATTT TCAGATAAAA ATGAAATAAG AGTTGGAAAT AATTTTGCCA 21720 
TGGAAATCAA TCTAAATGCC AACCTGTGGA GAAATTTCCT GTACTCCAAC ATAGCGCTGT 21780 
ATTTGCCCGA CAAGCTAAAG TACAGTCCTT CCAACGTAAA AATTTCTGAT AACCCAAACA 21840 
CCTACGACTA CATGAACAAG CGAGTGGTGG CTCCCGGGTT AGTGGACTGC TACATTAACC 21900 
TTGGAGCACG CTGGTCCCTT GACTATATGG ACAACGTCAA CCCATTTAAC CACCACCGCA 21960 
ATGCTGGCCT GCGCTACCGC TCAATGTTGC TGGGCAATGG TCGCTATGTG CCCTTCCACA 22020 
TCCAGGTGCC TCAGAAGTTC TTTGCCATTA AAAACCTCCT TCTCCTGCCG GG CTCAT ACA 22080 
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ZOuOU 


TPPTTPPPPP 
ILL I ILuLLL 




Lnl 1HL 


*5 a "5 n rt 


AGCAGCGGCC 


28260 


ATCCACAGCG 


28320 


ATCGACCCGC 


28380 
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OAGCTTAGAA ACAGGAT77T tGCCAGTCTG nTecr „. T ~ r .> 'CT/US,,,,,^ 
GAACAAGACC TGAAAATAAA ^ ' ' 

CAGAAAAGGG AAGATCAGCT TCGGCGCACG C^GGA T " CCCG " G """"" 2aS °° 

~~ ~ GGACTAGTTT ^ ^ — 
GTACGTCATC TCGAGCGGCC ACACCCGGCG CCAGCAGC-G - """"^ "«» 

AAGGAAATTC GCAGGCCG T A CATGTGGAGT TAG^G """"" 
OCTGCCCAAG AC.AG.CAAC CCGAATAAAC TACATGAGCG G ^ 

CGGGTCAACG GAATCGGCCC CCACCGAAAC CG^ ^ 

ACCACACCTC GTAATAACCT TAATCCCCGT AGTTGGCC "~ ^ 

AGTCCCGCTC GCACCACTGT GGTACTTCCC AGAGACGGCC AGGCCGAAGT 

AACTCAGGGG CGCAGCTTGC GGGCGGCTTT CGTCACAGGG ^ 

ATAACTCACC TGACAATCAG AGGGCGAGGT ATTCAGCTCA AG 
TCGCTTGGTC TGCGTCCGGA CGGGACATTT CAGATCGGCG GG 
ACGCCCGXC AGGCAATCCT AACTCTGCAG ACCTCGTC T 

~ — — =™ ~ : 

GGACCTCCCG GCCACTATCC GGATGAATTT ATTCCTAAC"" ^ 
GCGGACGGCT ACGACTGAAT GTTAATTAAG T ^ 

— atgaagcccg gaagacgg. T gaT^ r~ 

TGACACGCAA ACCGGTCCTC CAACTr-r "CAACCCCG TGTATCCATA 

TGGGTTTCAA GAGAGTCCCC CTGGGGTACT e ^ ^ 
CTCGAATGGC „GG TI GCGC TCAAAATGGG GAACGGCCTC ^ J"^ 
CCTTACCCC CAAAATGTAA CCACTGTGAG CCCACCTCTC A^ " 
AAACCTGGAA AXAXG.GCAC CCCTCACAGT TACGTCAGAA G ^ 
ATGGTCGCGG GCAACACACT CACCATGCAA ^ 
GCACGACTCC AAACTTAGCA TTGCCACCCA AGGACCCC TC a^ ^ 
AGGCCTGCAA ACATCAGGCC CGCTCACCAr ' ^^AG AAGGAAAGCT 

~ T CTAACTACTG CCACTGGTAG ^ 

TAGACAAAAT GGAAAACTAG GACTfifl ^GTTGAAAG AGCCCATTTA 

■ — ACGGTAGCAA CTGGTGCAGG ^ ^ " ^ 
TAAAGTTACT GGAGCCTTGG GTTTTGATTC AC. 

AGGAGTAAGG ATTO A TOT G AAAAG^ ^ ^™ "~ 
-CAAAAG CAACTAAATC ,AAGAG T AGG AGAG^ " ^ 

GAACTTGGAT ATTAACTACA ACAAAGGCGT TTACTTGTTT A ^ 
AAAGCTTGAG GTTAACCTAA GCACTGCCAA GGGGTTGATG TTTGACGCTA 
CATTAATGCA GGAGATGGGC TTGAATTTr. „ GAGCCATAGC 

TGAATTTGG TTCACCTAAT GGACCAAACA CAAATCCCCT 



28680 
29740 
23800 
28860 
28920 
28980 
29040 
29100 
29160 
29220 
2 9280 
29340 
29400 
29460 
29520 
29580 
29640 
29700 
29760 
29820 
29880 
29940 
30000 
30060 
30120 
30180 
30240 
30300 
30360 
30420 
30480 
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CAAAACAAAA 

AGGAACTGGG 

TAAGCTAACT 

AGATGCTAAA 

AGTTTTGGGT 

TATTATAAGA 

TTGGAACTTT 

TATGCCTAAC 

CAGTCAAGTT 

CGGTACACAG 

CTGGTCTGGC 

CATTGCCCAA 

AGAAAATTTG 

GATCACCGTA 

AACACACAGA 

TAACAGACAT 

CAGTGATATT 

GAGCCACAGG 

CCTACATGGG 

CGCGAATAAA 

CCTCAGCGAT 

GCACCCTGAT 

AAATCCCACA 

GGCCATCATA 

TAAACATTAC 

GATTAAACAT 

CTATACACTG 

CATGGATCAT 

ACTTCCTCAG 

CCTGAATCAG 

TTGTCAAAGT 

CTGTCTCAAA 

GTGTTGGTCG' 

AACCAGGTGC 

GTGTAGTAGT 



ATTGGCCATG 
CTTAGTTTTG 
TTGTGGACCA 
CTCACTTTGG 
GTTAAAGGCA 
TTTGACGAAA 
AGAAATGGAG 
CTATCAGCTT 
TACTTAAACG 
GAAACAGGAG 
CACAACTACA 
GAATAAAGAA 
AAGTCATTTT 
CCTTAATCAA 
GTACACAGTC 
ATTCTTAGGT 
AATAAACTCC 
CTGCTGTCCA 
GGTAGAGTCA 
CTGCTGCCGC 
GATTCGCACC 
CTCACTTAAA 
GTGCAAGGCG 
CCACAAGCGC 
CTCTTTTGGC 
GGCGCCATCC 
CAGGGAACCG 
CATGCTCGTC 
GATTACAAGC 
CGTAAATCCC 
GTTACATTCG 
AGGAGGTAGA 
TAGTGTCATG 
GGGCGTGACA 
TGTAGTATAT 



GCCTAGAATT 
ACAGCACAGG 
CACCAGCTCC 
TCTTAACAAA 
GTTTGGCTCC 
ATGGAGTGCT 
ATCTTAGTGA 
ATCCAAAATC 
GAGACAAAAC 
ACACAACTCC 
TTAATGAAAT 
TCGTTTGTGT 
TCATTCAGTA 
ACTCACAGAA 
CTTTGTCCCC 
GTTATATTCC 
CCGGGCAGCT 
ACTTGCGGTT 
TAATCGTGCA 
CGCCGCTCCG 
GCCCGGAGCA 
TCAGCACAGT 
CTGTATCCAA 
AGGTAGATTA 
ATGTTGTAAT 
ACCACCATCC 
GGACTGGAAC 
ATGATATCAA 
TCCTCCCGCG 
ACACTGCAGG 
GGCAGCAGCG 
CGATCCCTAG 
GCAAATGGAA 
AACAGATCTG 
CCACTCTCTC 



TGATTCAAAC 
TGCCATTACA 
ATCTCCTAAC 
ATGTGGCAGT 
AATATCTGGA 
ACTAAACAAT 
AGGCACAGCC 
TCACGGTAAA 
TAAACCTGTA 
AAGTGCATAC 
ATTTGCCACA 
TATGTTTCAA 
GTATAGCGCC 
CCCTAGTATT 
GGCTGGCCTT 
ACACGGTTTC 
CACTTAAGTT 
GCTTAACGGG 
TCAGGATAGG 
TCCTGCAGGA 
TAAGGCGCCT 
AACTGCAGCA 
AGCTCATGGC 
AGTGGCGACC 
TCACCACCTC 
TAAACCAGCT 
AATGACAGTG 
TGTTGGCACA 
TTAGAACCAT 
GAAGACCTCG 
GATGATCCTC 
TGTACGGAGT 
CGCCGGACGT 
GGTCTCCGGT 
AAAGCATCCA 



AAGGCTATGG 
GTAGGAAACA 
TGTAGACTAA 
CAAATACTTG 
ACAGTTCAAA 
TCCTTCCTGG 
TATACAAACG 
ACTGCCAAAA 
AGACTAACCA 
TCTATGTCAT 
TCGTCTTACA 
CGTGTTTATT 
ACCACCACAT 
CAACCTGCCA 
AAAAAGGATC 
CTGTCGAGCC 
GATGTCGCTG 
CGGCGAAGGA 
GCGGTGGTGC 
ATACAACATG 
TGTCCTGCGG 
CAGCACCAGA 
GGGGACCACA 
CCTCATAAAC 
CCGGTACGAT 
GGGCAAAACC 
GAGAGCGGAG 
ACACAGGGAC 
ATCCCAGGGA 
CACGTAACTC 
CAGTATGGTA 
GCGCCGAGAC 
AGTCATATTT 
CTCGCGGCTT 
GGCGCGCGGT 



PC77US97/19541 



TTGCTAAACT 


■3 n c j n 


AAAATAATGA 


JuDUU 


ATGGAGAGAA 


J vODu 


GTACAGTTTG 




GTGCTCATCT 


j \J 1 0 u 


AGCGAGAATA 


J v O H U 


CTGTTGGATT 




GTAACATTGT 




TTAGACTAAA 




TTTCATGGGA 


3 1080 


CTTTTTCATA 


3114 0 


TTTCAATTGG 


312 00 


AGCTTATACA 




CCTCGCTGCC 


313 20 


ATATCATGGG 


313 8 0 


AAACGCTGAT 




TCCAGCTGCT 


J i J u u 


GAAGTCCACG 


J X J 0 u 


TGCAGCAGPH 




GCAGTGGTCT 




GCACAGCAGC 


3174 0 


ATATTGTTCA 


318 00 


GAAGGCACGT 


3186 0 


ACGCTGGACA 


31920 


ATAAACCTCT 


31980 


TGCCCGCGGG 


3204 0 


GACTCGTAAC 


32100 


ACGTGCATAC 


32160 


AGAACCCATT 


32220 


ACGTTGTGCA 


32280 


GCGCGGGTTT 


3234 0 


AACGGAGATC 


32400 


GCTGAAGCAA 


32460 


AGATCGCTCT 


32520 


GGCTTCGGGT 


32580 
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^ CCC.CATG CGCCCCTCCC CTCATAACAT CCACCACCCC AGAATAAGCC 



ACACCCACCC AACCTACACA TTCCTTCTGC GAGTCACACA CGCGAGGACC GCGAAGAGCT 
GGAAGAACCA TGTTTTTT77 TTTATTCCAA AAGATTATCC AAAACCTCAA AATGAAGATC 
TATTAAGTGA ACCCGCTCCC CTCCGCTGGC GTGGTCAAAC TCTACAGCCA AAGAACAGAT 
AATGGCATTT GTAAGATGTT GCACAATGGC TTCCAAAAGG CAAACGGCCC TCACGTCCAA 
GTGGACGTAA AGGCTAAACC CTTCAGGGTG AATCTCCTCT ATAAACATTC CAGCACCTTC 
AACCATGCCC AAATAATTCT CATCTCGCCA CCTTCTCAAT ATATCTCTAA GCAAATCCC 
AATATTAAGT CCGCCCATTG TAAAAATCTG CTCCAGAGCG CCCTCCACCT TCAGCCTCAA 
GCAGCGAATC ATGATTGCAA A^cc, XCCTCACAGA 

GGAACATTAA CAAAAATACC GCGATCCCGT AGGTCCCTTC GCAGGGCCAG CTGAACATAA 
TCGTGCAGGT CTGCACGGAC CAGCGCGGCC ACTTCCCCGC CAGGAACCTT GACAAAAGAA 
CCCACACTCA TTATGACACG CATACTCGGA GCTATGCTAA CCAGCGTAGC CCCGATGTAA 
GCTTTGTTGC ATGGGCGGCG ATATAAAATG CAAGGTGCTG CTCAAAAAAT CAGGCAAAGC 
CTCGCGCAAA AAAGAAAGCA CATCGTAGTC ATGCTCATGC AGATAAAGGC AGGTAAGCTC 
CGGAACCACC ACAGAAAAAG ACACCATTTT TCTCTCAAAC ATGTCTGCGG GTTTCTGCAT 
AAACACAAAA TAAAATAACA AAAAAACATT TAAACATTAG AAGCCTGTCT TACAACAGGA 
AAAACAACCC TTATAAGCAT AAGACGGACT ACGGCCATGC CGGCGTGACC GTAAAAAAAC 
TGGTCACCGT GATTAAAAAG CACCACCGAC AGCTCCTCGG TCATGTCCGG AGTCATAATG 
TAAGACTCGG TAAACACATC AGGTTGATTC ATCGGTCAGT GCTAAAAAGC GACCGAAATA 
GCCCGGGGGA ATACATACCC GCAGGCGTAG AGACAACATT ACAGCCCCCA TAGGAGGTAT 
AACAAAATTA ATAGGAGAGA AAAACACATA AACACCTGAA AAACCCTCCT GCCTAGGCAA 
AATAGCACCC TCCCGCTCCA GAACAACATA CAGCGCTTCA CAGCGGCAGC CTAACAGTCA 
CCCTTACCAG TAAAAAAGAA AACCTATTAA AAAAACACCA CTCGACACGG CACCAGCTCA 
ATCAGTCACA GTGTAAAAAA GGGCCAAGTG CAGAGCGAGT ATATATAGGA CTAAAAAATG 
ACGTAACGGT TAAAGTCCAC AAAAAACACC CAGAAAACCG CACGCGAACC TACGCCCAGA 
AACGAAAGCC AAAAAACCCA CAACrTCCTC AAATCGTCAC TTCCGTTTTC CCACGTTACG 
TAACTTCCCA TTTTAAGAAA ACTACAATTC CCAACACATA CAAGTTACTC CGCCCTAAAA 
CCTACGTCAC CCGCCCCGTT GCCACGCCCC GCGCCACGTC ACAAACTCCA - CCCCCTCATT 
ATCATATTGG CTTCAATCCA AAATAAGGTA TATTATTGAT GAT 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 380 base pairs 

(B) TYPE nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

M ?^LI YPE: ° ther nuclei <= ^id 
(A) DESCRIPTION : /desc = " DNA " 

SEQUENCE DESCRIPTION: SEQ ID NO:S: 
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32640 
32700 
32760 
32820 
32880 
32940 
33000 
33060 
33120 
33180 
33240 
33300 
33360 
33420 
33480 
33540 
33600 
33660 
33720 
33780 
33840 
33900 
33960 
34020 
34080 
34140 
34200 
34260 
34303 
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CCAGGC77TA CACTTTATGC TTCCGGCTCG TATGTTGTGT GGAATTGTGA GCGGATAACA *q 

ATTTCACACA GGAAACAGCT ATGACCATGA TTACGCCAAG CGCGCAATTA ACCCTCACTA 120 

AAGGGAACAA AAGCTGGGTA CCGGGCCCCC CCTCGAGGTC GACGGTATCG ATAAGCTTAC 18 0 

GCGTGGCCTA GGCGGCCGAA TTCCTGCAGC CCGGGGGATC CACTAGTTCT AGAGCGGCCG 24 0 

CCACCGCGGC GCCTTAATTA ATACGTAAGC TCCAATTCGC CCTATAGTGA GTCGTATTAC 300 

GCGCGCTCAC TGGCCGTCGT TTTACAACGT CGTGACTGGG AAAACCCTGG CGTTACCCAA 3 60 

CTTAATCGCC TTGCAGCACA 380 
(2) INFORMATION FOR SEQ ID NO : 6 : 

( i i SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xil SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

TGCCGCAGCA CCGGATGCAT C 21 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

GCGTCCGGAG GCTGCCATGC GGCAGGG 2 7 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1481 base pairs 

(B) TYPE: nucleic acid" 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 8 : 

GGCTGCCATG CGGCAGGGAT ACGGCGCTAA CGATGCATCT CAACAATTGT TGTGTAGGTA 6 0 

CTCCGCCGCC GAGGGACCTG AGCGAGTCCG CATCGACCGG ATCGGAAAAC CTCTCGAGAA 12 0 

AGGCGTCTAA CCAGTCACAG TCGCAAGGTA GGCTGAGCAC CGTGGCGGGC GGCAGCGGGC 180 

GGCGGTCGGG GTTGTTTCTG GCGGAGGTGC TGCTGATGAT GTAATTAAAG TAGGCGGTCT 24 0 

TGAGACGGCG GATGGTCGAC AGAAGCACCA TGTCCTTGGG TCCGGCCTGC TGAATGCGCA 300 

GGCGGTCGGC CATGCCCCAG GCTTCGTTTT GACATCGGCG CAGGTCTTTG TAGTAGTCTT 360 

GCATGAGCCT TTCTACCGGC ACTTCTTCTT CTCCTTCCTC TTGTCCTGCA TCTCTTGCAT 4 20 
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CTATCGCTGC GGCGGCGGCG GA G t T t ggcc r ^ PCT/US97/19541 

tcaccccgaa G .cc:c-r GGCTG " CCCTCTOT C ™- «- 

~» ™ — c : ~ i:r cg ~~ 

— CGTGTTGATG GTGTAAGTGC _ ~ = 
GGTGAC.CG CTGCGAGAGC TCCCTCTACC TGAGACCCGA CTAAGCCOC CACc 
CGTAGTCGTT GCAAG.CCC ACCAGGTACT GGTATCCCAC CAAAAA^ - G 
~ C " — G GGGCTCCGGG ~ ^ 
GGCGATGATA TCCGTAGA7G TACCTGGACA TCCAGGTGAT GCCGCCGCCC 
CGCGCGGAAA GTCGCGGACG CGGTTCCAGA TGTTGCGCAG CGGCAAAAAG TGCTCCATGG 
TCGGGACGCT CTGGCCGGTC AGGCGCGCGC AA^CAC GCTCTACCG, ^2 
~ «~»» ™ GGTGGATAAA ^ ^ 

a™ g TO gagccc cc T a T ccggc cgtccgccgt ga IC ca T gcg cJc" 

1:"" i" cm ~ ««- » — T ggc 

ccagg^ ™cc gctagctttt TO gccactg gccgcgcgca gcgtaagggg 

™ -CGA AAGCGAAAGC ATTAAGTGGC TCGCTCCCTG TAGCCGGAGG 
—G TCGCGGGACC CCCGGTTCGA GTCTCGGACC GGCCGGACTG J^Z 

c™ c : ::~ c ~ °™ — ~ 

CCCCTTTTTT GCTTTTCCCA GATGCATCCG GTGCTGCGGC A 
(2) INFORMATION FOR SEQ ID NO : 9 : 1481 

(i) SEQUENCE CHARACTERISTICS • 

A, LENGTH: 3 3 64 base paifs 
H TYPE: nucleic acid 
C STRANDEDNESS : double 
TOPOLOGY: linear 

(ii) MOLECULE TYPE • nf h„ n - 

«A) OESC R Ip P T E I0 ^ h ? d r e sc UC !^ D C N ^ id 
(XD SEQUENCE DESCRIPTION: SEQ ID N0 • 9 - 

ATCTGGAAAA 7^ ~C 
lc«r ATGACCCAAG AGGGGTTCTC AGAGGGTGGC GXG.GC.CCC 

TGGCAAGCCT ATGACATGGC CGGGGCCTGC CTCTCTCTGC CTC.CACCCT CAGTGGCTCC 
CATGAACTCC TTGCCCAATG GCATCTTTTT CCTGCGCTCC TOOTWT CCAGTCTCCC 
CTCAGCATTC CTTCCTCAGG GCC.CGCC, TCTm C OT CC TTCC ACAGCTGGCT 
C-GCACC, CAGATGTCAC AG.GCXC.C, CAGAGGAGGA AGGCACCATG TACCCTCTGT 
CCCAGG.A AGGGTTCAAT TTTTAAAAAT GGTTTTTTGT ^ m g ITO ^ 
™ TTCAAGACAG GGCTCCTCTG TGTAGTCCTA AC T „ AA^ 
GTAGACCAGG TCGACCTCGA AC TOT GAAA CCXGCCACGG A C C„ 

™: ::r Tcc ™= ~ — — 

AACAAACACA GAGGCAGTTT GAATCTGAGT GTATTTTGCA GCTC.CAAGC AGGGGATTTT 



480 
540 
600 
660 
720 
780 
340 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



60 
. 120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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\l/rt Oft/1 7*701 
W \J yofl I loo 

A7ACATAAAA 


AAAAAAAAAA 


AAAAAAAACC 


AAACATTACA 




PCT/US97/19541 

ACTATATCCA 72 0 


ATGAAACAAT 


CACAGATACC 


AACCAAAACC 


ATTGGGCAGA 




AAAATCATCC 


780 


AAGCATTACA 


ACTCTGAAAC 


CATGTATTCA 


GTGAATCACA 


a a pap a ap ap 


GTAACATCAT 


840 


TATTAA7ATA 


AATCACCAAA 


ATATAACAAT 


TCTAAAAGGA 


TGTATPPAP,'" 


GGGGGCTGTC 


900 


GTCCAAGGCT 


AGTGGCAGAT 


TTCCAGGAGC 


AGGTTAGTAA 




CTGAACTAAC 


960 


TCTCCAGCCC 


CATGGTCAAT 


TATTATTTAG 


CATCTAGTGC 


CTAATTTTTT 


TTTATAAATC 


1020 


TTCACTATGT 


AATTTAAAAC 


TATTTTAATT 


CTTCCTAATT 


AAnnr^TTrT 

"rtUUV, i 1 1 \_ 1 


TTACCATATA 


1080 


CCAAAATTCA 


GCTCCAATGA 


CACACGGGTA 


GCCATATGAA 


ATTTT ATTG ^ 


TGGGAAAATT 


1140 


TGTAGCTATC 


ATAATAGTTT 


TGTAAATGAT 


TTAAAAAGCA 


AAGTGTTAGP 


CGGGCGTGGT 


1200 


GGCACACGCC 


TTTAATCCCT 


GCACTCGGGA 


GGCAGGGGCA 


GGAGGATTTr 


TGAGTTTGAG 


1260 


GCCAGCCTGG 


TCTACAGAGT 


GAGTTCCAGG 


ACAGCCAGGG 


CTACACAGAG 


AAACCCTGTC 


1320 


TCGAACCCCC 


CACCCCCCAA 


AAAAAGCAAA 


GTGTTGGTTT 




AAAGTCATGT 


1380 


TAGTGGCCCA 


TGTCTAGGCC 


CATCTGACCC 


ATTATTCTCG 


PTTA AH nTPT 


TGGCCTAGGC 


1440 


TACCAGGAAC 


ATGTAAATAA 


GAAAAGGAAT 


AAGAGAAAAC 




GATTGGCATG 


1500 


AGAACTACGG 


GTCAATATTT 


TTTCTCTCCG 


G CG AAGAGTT 


PPAPA arraT 


CTGCAGGAGG 


1560 


CCTCGACGTT 


TTGAGGTCAA 


TGGCCTCAGT 


CTGTGGAACT 


i 1 V rt\_ rto 


ATCTTAGTGG 


1620 


AGGTGGTGTG 


GGAGAAACCC 


ATTCCTTTTA 


GTGTCTTGGG 


PTA A A A PTA A 


AAGGCGCAGA 


1680 


GGAGGCCTTT 


GCTCATCTGA 


CCATGCTGAC 


AAGGAACACG 




CAGAGGCTGG 


1740 


ACCCCAGGAA 


GACCTTAAAC 


ACTTCTTCCC 


TTCTCCGCPP 


v_ i M.\jM.Vj V_ M.VJ 


GCTCCCGTCA 


1800 


CCAGCCTGGG 


GAGAAATGGG 


GGAAGATGGA 


GTGAAGCCAT 


A PTnnrT a 


CCAGAATCAA 


1860 


CAGAGGGAGC 


CGGGGGCAAT 


ACTGGAGAAG 


CTGGTCTPPP 


v_ rtUO O VJ M 


ATCCTGGCAC 


1920 


CTCCCAGGCA 


GAAGAGGAAA 


CTTCCACAGT 


GCATCTCACT 


tppatp a aTr 


CCCTCCTCGG 


1980 


ACTCTGAGGT 


GCTTGGTCAC 


AGGTGAGGTG 


CAAAAGGCTC 


PTHTPATATT 


GTGTCCTGCT 


2040 


CTGGTCTGCG 


TTCCACAGCT 


TGGGGGGCAC 


CTAGCCCACC 


TCTPPPTAGG 


GATGAGAGCA 


2100 


GCCACTAGGG 


GTCTAGGCTG 


CCCATGTAAG 


GAGGCAAGGC 




CGGAGATGCC 


2160 


TGGTTATAAT 


TAACCCAGAC 


ATGTGGCTGC 


cccccccccc 


PPAAPAPPTP. 


CTGCCTGAGC 


2220 


CTCACCCCCA 


CCCCGGTGCC 


TGGGTCTTAG 


GCTCTGTACA 


PPATPPAtin A 


GAAGCTCGCT 


2280 


CTAAAAATAA 


CCCTGTCCCT 


GGTGGATCCA 


GGGTGAGGGG 


CAGGPTGAGn 


GCGGCCACTT 


2340 


CCCTGAGCCG 


CAGGTTTGTT 


TTCCCAAGAA 


TGGTTTTTPT 


PPTTPTPTar 1 


CTTTTCCTGT 


2400 


CAATTCTGGC 


ATGGTGGAGC 


AGCCTGCACT 


GGGCTTCTGG 


GAGA AAPPA A 


ACCGGGTTCT 


2460 


AACCTTTCAG 


CTACAGTTAT 


TGCCTTTCCT 


GTAGATGGGC 


GACTAPAGPP 


CCACCCCCAC 


2520 


CCCCGTCTCC 


TGTATCCTTG 


CTGGGCCTGG 


GGATCCTAGG 


PTTTP A PTPP 


AAATTTGCCC 


2580 


CCAGGTGCTG 


TAGGCTAGAG 


TCACGGCTCC 


CAAGAACAGT 




CATGCATGGT 


2640 


TCTGAACCTC 


pa aptp.pa a a 


AAA I UAL ACA 


TACCTTGACC 


CTTGGAAGGC 


TGAGGCAGGG 


2700 


GGATTGCCAT 


GAGTGCAAAG 


CCAGACTGGG 


TGGCATAGTT 


AGACCCTGTC 


TCAAAAAACC 


2760 
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AAAAACAA7T 


AAATAACTAA 


AGTCAGGCAA 


GTAATCCTAG 


TCGGGAGACT 


GAGGCAGAGG 2 820 


GATTGTTACA 


TGTCTGAGGC 


CAGCCTGGAC 


TACATAGGGT 


TTCAGGCTAG 


CCCTGTCTAC 


2380 


AGAGTAAGGC 


CCTATTTCAA 


AAACACAAAC 


AAAATGGTTC 


TCCCAGCTGC 


TAATGCTCAC 


2940 


CAGGCAATGA 


AGCCTGGTGA 


GCATTAGCAA 


TGAAGGCAAT 


GAAGGAGGGT 


GCTGGCTACA 


3000 


ATCAAGGCTG 


TGGGGGACTG 


AGGGCAGGCT 


GTAACAGGCT 


TGGGGGCCAG 


GGCTTATACG 


3060 


TGCCTGGGAC 


TCCCAAAGTA 


TTACTGTTCC 


ATGTTCCCGG 


GGAAGGGCCA 


GCTGTCCCCC 


3120 


GCCAGCTAGA 


CTCAGCACTT 


AGTTTAGGAA 


CCAGTGAGCA 


AGTCAGCCCT 


TGGGGCAGCC 


3180 


CATACAAGGC 


CATGGGGCTG 


GGCAAGGTGC 


ACGCCTGGGT 


CCGGGGTGGG 


CACGGTGCCC 


3240 


GGGCAACGAG 


CTGAAAGCTC 


ATCTGCTCTC 


AGGGGCCCCT 


CCGTGGGGAC 


AGCCCCTCCT 


3300 


GGCTAGTCAC 


ACCCTGTAGG 


CTCCTCTATA 


TAACCCAGGG 


GCACAGGGGC 


TGCGCCGGGG 


3360 


TCAC 












3364 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = •• DNA " 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GGTACCACTA CGGGTCTAGG CTGCCCATGT AAGGAGGCAA GGCCTGGGGA CACCCGAGAT 
GCCTGGTTAT AATTAACCCC AACACCTGCT GCCCCCCCCC CCCCAACACC TGCTGCCTGA 
GCCTGAGCGG TTACCCCACC CCGGTGCCTG GGTCTTAGGC TCTGTACACC ATGG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 699 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

di) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 

AGATCTTCCT AGAAATTCGC TGTCTGCGAG GGCCGGCTGT TGGGGTGAGT ACTCCCTCTC 

AAAAGCGGGC ATGACTTCTG CGCTAAGATT GTCAGTTTCC AAAAACGAGG AGGATTTGAT 

ATTCACCTGG CCCGCGGTGA TGCCTTTGAG GGTGGCCGCG TCCATCTGGT CAGAAAAGAC 

AATCTTTTTG TTGTCAAGCT TGAGGTGTGG CAGGCTTGAG ATCGATCTGG CCATACACTT 

GAGTGACAAT GACATCCACT TTGCCTTTCT CTCCACAGGT GTCCACTCCC AGGTCCAACC 

GCGGATCTCC CGGGACCATG CCCAAGAAGA AGAGGAAGGT GTCCAATTTA CTGACCGTAC 

ACCAAAATTT GCCTGCATTA CCGGTCGATG CAACGAGTGA TGAGGTTCGC AAGAACCTGA 

TGGACATGTT CAGGGATCGC CAGGCGTTTT CTGAGCATAC CTGGAAAATG CTTCTGTCCG 
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TTTGCCGGTC GTGGGCGGCA TGGTGCAAGT TGAATAACCG GAAATGGTTT CCCGCAGAAC 54 0 

CTGAAGATGT TCGCGATTAT CTTCTATATC TTCAGGCGCG CGGTCTGGCA GTAAAAACTA 6 00 

TCCAGCAACA TTTGGGCCAG CTAAACATGC TTCATCGTCG GTCCGGGCTG CCACGACCAA 66 0 

GTGACAGCAA TGCTGTTTCA CTGGTTATGC GGCGGATCC 6 99 

(2) INFORMATION FOR SEQ ID NO: 12: 

ti) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ocher nucleic acid 
(A) DESCRIPTION: /desc = " DNA " 

(Xi)- SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GAAGATCTAT AACTTCGTAT AATGTATGCT ATACGAAGTT ATTACCGAAG AAATGGCTCG 6 0 

AGATCTTC 

(2) INFORMATION FOR SEQ ID NO: 13: 



68 



ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 13: 

GAAGATCTCG AGCCATTTCT TCGGTAATAA CTTCGTATAG CATACATTAT ACGAAGTTAT 6 0 

AGATCTTC 6 g 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CCACATGTAT AACTTCGTAT AG CAT AC ATT ATACGAAGTT ATACATGTGG 50 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CCACATGTAT AACTTCGTAT AATGTATGCT ATACGAAGTT ATACATGTGG 50 
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' • A recombinant plasmid. c„ mpri si„ s in operable 

antibio^ " Pl3Smid baCkb0ne ' C ° mPnSinS °" **" ° f • « 

an„b,o„c reststance gene and a eukaryotic promoler dm 

") <he left and right inverted terminal repeats (ITRs) of 
adenovirus, said ,TRs each havi„ g a ,■ and a y end and arranged ,n a tai, 
io ta,l onentation on said plasmid backbone: 

«> 'he adenovirus packaging sequence, said P ac kae i„ a sequence 
'•avmg a , and a y end and linked to one of said ,TRs : and ' 

0) a first gene of interest operably linked ,„ sa id prom o,er 

element. 



The recombinant piasm.d of Claim . wherein the total size of said 



plasmid is between 27 and 40 kilobase 



pairs. 



3. The recombman, plasmid of Claim I. wherein said y end of said 
packagtng sequence is l.nked to said 3' end of said left ITR. 

4 The recombinant plasmid of Claim 3. wherein said firs, «ene of 
-eres, is linked to said 3' end of said packaging sequence. 

5. The recombinant plasmid of Claim 4. where.n said firs, 2 e„e of 
■meres, is the dystrophin cDNA gene. 

6 The recombinant plasmtd of Claim 4. further comprising a second 

gene ot interest. 



I The recombinant plasmid of Claim 6. wherein said second K „e of 
■meres, ,s linked to said 3' end of said right ITR. 

8- The recombinant plasm.d of Cla.m 7. wherein said second e e„e of 
interest is a reporter aene. 
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9. The recombinant plasmid of Claim 8. wherein said reporter gene is 
selected from the group consisting of the £ coli (3-galactosidase gene, the hi 
placental alkaline phosphatase gene, the green Huroescent protein gene and the 
choramphenicol acetyltransferase eene. 



luman 



10. 
Claim I. 



A mammalian cell line containing the recombinant plasmid of 



The cell line of Claim 10, wherein said cell line is a 293-derived cell 



line. 



12. A helper adenovirus comprising i) first and a second loxP sequences. 
10 and ii) the adenovirus packaging sequence, said packaging sequence havine a 5" 

and a 3* end. 

13. The helper adenovirus of Claim 12, wherein said first loxP sequence 
is linked to the 5* end of said packaging sequence and said second loxP sequence is 
linked to said 3" end of said packaging sequence. 

15 14 - The helper adenovirus of Claim 12. further comprising at least one 

adenovirus gene coding region. 

15. A mammalian cell line, comprising: 

a) a recombinant plasmid. comprising, in operable combination: 

i) a plasmid backbone, comprising an origin of 

-° replication, an antibiotic resistance gene and a eukaryotic promoter 

element, 

ii) the left and right inverted terminal repeats (ITRs) of 
adenovirus, said ITRs each having a 5' and a 3* end and arranged in 
a tail to tail orientation on said plasmid backbone. 

iii) the adenovirus packaging sequence, said packaging 
sequence having a 5* and a 3* end and linked to one of said ITRs. 
and 
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iv) a first gene of interest operabiy linked to said 
promoter element: and 

b) a helper adenovirus comprising i) first and a second loxP 
sequences, and ii) the adenovirus packaging sequence, said packaging 
sequence having a 5" and a 3' end. 

16. The cell line of Claim 15. wherein said helper adenovirus further 
comprises at least one adenovirus gene coding region. 

17. The cell line of Claim 15. wherein said first gene of interest is the 
dystrophin cDNA gene 

10 1 8. A method of producing an adenovirus minichromosome. comprising: 

A) providing a mammalian cell line containing: 

a) a recombinant plasmid. comprising, in operable 
combination, i) a plasmid backbone, comprising an origin of 
replication, an antibiotic resistance gene and a eukarotic promoter 
element, ii) the left and right inverted terminal repeats (ITRs) of 
adenovirus, said ITRs each having a 5" and a 3" end and arranged in 
a tail to tail orientation on said plasmid backbone, iii) the adenovirus 
packaging sequence, said packaging sequence having a 5' and a 3' 
end and linked to one of said ITRs. and iv) a first gene of interest 
operabiy linked to said promoter element: and 

b) a helper adenovirus comprising i) first and a second 
loxP sequences, ii) at least one adenovirus gene coding region, and 
iii) the adenovirus packaging sequence, said packaging sequence 
having a 5* and a 3" end: and 

B) growing said cell line under conditions such that said 
adenovirus gene coding region is expressed and said recombinant plasmid 
directs the production of at least one adenoviral minichromosome. 

1 9. The method of Claim 1 8. wherein said adenovirus minichromosome 
is encapidated. 



15 



20 
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20. The method of Claim 18. further comprising 3) recovering said 
encapsidated adenovirus minichromosome. 

21. The method of Claim 20. further comprising 4) purifiving said 
recovered encapsidated adenovirus minichromosome. 

5 22. The method of Claim 21. further comprisng 5) administering said 

purified encapsidated adenovirus minichromosome to a host. 

23. The method of Claim 22. wherein said host is a mammal. 

24. The method of Claim 23, wherein said mammal is a human. 

25. A recombinant adenovirus comprising the adenovirus E2b region 
10 having a deletion, said adenovirus capable of self-propagation in a packaging cell 

line and said E2b region comprising the DNA polymerase <:ene and the adenovirus 
preterminal protein gene. 

26. The recombinant adenovirus of Claim 25. wherein said deletion is 
within the adenovirus DNA polymerase gene. 

'5 27. The recombinant adenovirus of Claim 25. wherein said deletion is 

within the adenovirus preterminal protein gene. 

28. The recombinant adenovirus of Claim 25. wherein said deletion is 
within the adenovirus DNA polymerase and preterminal protein genes. 

29. A mammalian cell line stably and constitutively expressing the 
20 adenovirus El gene products and the adenovirus DNA polymerase. 

30. The cell line of Claim 29. wherein said cell line comprises a 
recombinant adenovirus comprising a deletion within the E2b region, said 
recombinant adenovirus being capable of self-propagation in said cell line. 
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3 1 The ceil line of Claim 30. wherein said deletion within said E2b 
region comprises a deletion within the adenoviral DNA polymerase gene. " 

32. The cell of Claim 29. wherein the genome of said cell line contains 
a nucleotide sequence encoding adenov.rus DNA polymerase operably linked to a 

5 heterologous promoter. 



33. The cell line of Claim 32. wherein sa.d cell line is selected from the 
group consisting of the B-6. B-9. C-l. C-4. C-7. C-13. and C-14 cell lines. 

34. The cell line of Claim 29 further constitutively expressing the 
adenovirus preterminal protein gene product. 

35. The cell line of Claim 34. wherein said cell line comprises a 
recombinant adenovirus comprising a deletion within the E2b region, said 
recombinant adenovirus being capable of self-propagation in said cell line. 

36. The cell line of Claim 35. wherein said deletion within said E2b 
mprises a deletion within the adenoviral preterminal protein gene. 



reeion co 



'5 37. The ceil line of Claim 36. wherein said deletion within said E2b 

region comprises a deletion within the adenoviral DNA polymerase and preterminal 



protein genes. 



38. .... The cell line of Claim 34. wherein the genome of said cell line 
contains a nucleotide sequence encoding adenovirus preterminal protein operably 

20 linked to a heterologous promoter. 

39. The cell line of Claim 38. wherein said cell line is selected from the 
group consisting of the C-l. C-4. C-7. C-13. and C-14 cell lines. 
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40. A method of producing infectious recombinant adenovirus particles 
containing an adenoviral genome containing a deletion within the E2b region, 
comprising 

a) providing: 

3 i) a mammalian cell line stably and constitutivelv 

expressing the adenovirus El gene products and the adenovirus DNA 
polymerase; 

ii) a recombinant adenovirus comprising a deletion within 
the E2b region, said recombinant adenovirus being capable of self- 
10 propagation in said cell line; 

b) introducing said recombinant adenovirus into said cell line 
under conditions such that said recombinant adenovirus is propacated to 
form infectious recombinant adenovirus particles; and 

c) recovering said infectious recombinant adenovirus particles. 

15 41. The method of Claim 40, further comprising d) purifying said 

recovered infectious recombinant adenovirus particles. 

42. The method of Claim 41, further comprising e) administering said 
purified recombinant adenovirus particles to a host. 

43. The method of Claim 40, wherein said mammalian cell line further 
20 constitutivelv expresses the adenovirus preterminal protein. 

44. A recombinant plasmid capable of replicating in a bacterial host 
comprising adenoviral E2b sequences, said E2b sequences containing a deletion 
within the polymerase gene, said deletion resulting in reduced polymerase activity. 

45. . The recombinant plasmid of Claim 44. wherein said deletion 
25 comprises a deletion of nucleotides 8772 to 9385 in SEQ ID NO:4. 

46. The recombinant plasmid of Claim 45. wherein said plasmid has the 
designation pApol. 
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47. The recombinant plasmid of Claim 45. wherein said plasmid has the 
designation pBHGI lApol. 



48. A recombinant plasmid capable of replicating in a bacterial host 
comprising adenoviral E2b sequences, said E2b sequences containing a deletion 

5 within the preterminal protein gene, said deletion resulting in the inability IO 

express functional preterminal protein without disruption of the VA RN'A aenes. 

49. The recombinant plasmid of Claim 48. wherein said deletion within 
said preterminal protein gene comprises a deletion of nucleotides 10.705 to 1 1.134 
in SEQ ID NO:4. 

10 50. The recombinant plasmid of Claim 49. wherein said plasmid has the 

designation pApTP. 

51. The recombinant plasmid of Claim 49, wherein said plasmid has the 
designation pBHGI lApTP. 

52. The recombinant plasmid of Claim 48 further comprising a deletion 
1 5 within the polymerase gene, said deletion resulting in reduced polymerase activity. 

53. The recombinant plasmid of Claim 52. wherein said deletion within 
said polymerase and said preterminal protein genes comprises a deletion of 
nucleotides 8.773 to 9586 and 1 1.067 to 12.513 in SEQ ID NO:4. 

54. The recombinant plasmid of Claim 53. wherein said plasmid has the 
20 designation pAXBApolApTPVARNA+H3. 



55. The recombinant plasmid of Claim 53. wherein said plasmid has the 
designation pBHGI 1 ApolApTPVARNA+tl3. 
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